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POLYPEPTIDE-POLYMER CONJUGATES HAVING ADDED AND/OR REMOVED ATTACHMENT GROUPS 

FIELD OF THE INVENTION 

The present invention relates to polypeptide-polymer 
5 conjugates having added and/or removed one or more attachment 
groups for coupling polymeric molecules on the surface of the 3D 
structure of the polypeptide, a method for preparing polypeptide- 
polymer conjugates of the invention, the use of said conjugated 
for reducing the immunogenicity and allergenicity, and 
10 compositions comprising said conjugate. 

BACKGROUND OF THE INVENTION 

The use of polypeptides, including enzymes, in the 
circulatory system to obtain a particular physiological effect is 

15 well-known in the medical arts. Further, within the arts of 
industrial applications, such as laundry washing, textile 
bleaching, person care, contact lens cleaning, food and feed 
preparation enzymes are used as a functional ingredient. One of 
the important differences between pharmaceutical and industrial 

20 application is that for the latter type of applications (i.e. 
industrial applications) the polypeptides (often enzymes) are not 
intended to enter into the circulatory system of the body. 

Certain polypeptides and enzymes have an unsatisfactory 
stability and may under certain circumstances - dependent on the 

25 way of challenge - cause an immune response, typically an IgG 
and/ or IgE response. 

It is today generally recognized that the stability of 
polypeptides is improved and the immune response is reduced when 
polypeptides, such as enzymes, are coupled to polymeric molecules. 

30 It is believed that the reduced immune response is a result of the 
shielding of (the) epitope (s) on the surface of the polypeptide 
responsible for the immune response leading to antibody formation 
by the coupled polymeric molecules. 

Techniques for conjugating polymeric molecules to polypeptides 

3 5 are well-known in the art. 

One of the first suitable commercially techniques was described 
back in the early 1970'ies and disclosed in e.gr. US patent no. 
4,179,337. Said patent concerns non- immunogenic polypeptides, such 
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as enzymes and peptide hormones coupled to polyethylene glycol 
(PEG) or polypropylene glycol (PPG). At least 15% of polypeptides 1 
physiological activity is maintained. 

GB patent no. 1,183,257 (Crook et al.) describes chemistry for 
5 conjugation of enzymes to polysaccharides via a triazine ring. 

Further, techniques for maintaining of the enzymatic activity 
of enzyme-polymer conjugates are also known in the art. 

WO 93/15189 (Veronese et al.) concerns a method for maintaining 
the activity in polyethylene glycol-modif ied proteolytic enzymes 
10 by linking the proteolytic enzyme to a macromolecularized 
inhibitor. The conjugates are intended for medical applications. 

It has been found that the attachment of polymeric molecules to 
a polypeptide often has the effect of reducing the activity of the 
polypeptide by interfering with the interaction between the 
15 polypeptide and its substrate. EP 183 503 (Beecham Group PLC) 
discloses a development of the above concept by providing 
conjugates comprising pharmaceutically useful proteins linked to 
at least one water-soluble polymer by means of a reversible 
linking group. 

20 EP 471,125 (Kanebo) discloses skin care products comprising a 
parent protease (Bacillus protease with the trade name Esperase®) 
coupled to polysaccharides through a triazine ring to improve the 
thermal and preservation stability. The coupling technique used is 
also described in the above mentioned GB patent no. 1,183,257 

25 (Crook et al. ) . 

JP 3083908 describes a skin cosmetic material which 
contains a transglutaminase from guinea pig liver modified with 
one or more water-soluble substance such as PEG, starch, 
cellulose etc. The modification is performed by activating the 

30 polymeric molecules and coupling them to the enzyme. The 
composition is stated to be mild to the skin. 

However, it is not always possible to readily couple 
polymeric molecules to polypeptides and enzymes. Further, there is 
still a need for polypeptide-polymer conjugates with an even more 

35 reduced immunogenicity and/ or allergenicity . 

SUMMARY OP THE INVENTION 

It is the object of the present invention to provide improved 
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polypeptide-polymer conjugates suitable for industrial and 

pharmaceutical applications. 

The term "improved polypeptide-polymer conjugates 11 means in the 

context of the present invention conjugates having a reduced 
5 immune response in humans and animals and/or a improved stability. 

As will be described further below the immune response is 

dependent on the way of challenge. 

The present inventors have found that polypeptides, such as 

enzymes, may be made less immunogenic and/or allergenic by adding 
10 and/or removing one or more attachment groups on the surface of 

the parent polypeptide to be coupled to polymeric molecules. 

When introducing pharmaceutical polypeptide directly into the 

circulatory system (i.e. bloodstream) the potential risk is an 

immunogenic response in the form of mainly IgG, IgA and/or IgM 
15 antibodies. In contrast hereto, industrial polypeptides, such as 

enzymes used as a functional ingredient in e.g. detergents, are 

not intended to enter the circulatory system. The potential risk 

in connection with industrial polypeptides is inhalation causing 

an allergenic response in the form of mainly IgE antibody 
20 formation. 

Therefore, in connection with industrial polypeptides the 
potential risk is respiratory allergenicity caused by inhalation, 
intratracheal and intranasal presentation of polypeptides. 

The main potential risk of pharmaceutical polypeptides is 

2 5 immunogenicity caused by intradermally, intravenously or subcu- 

taneously presentation of the polypeptide. 

It is to be understood that reducing the "immunogenicity" 
and reducing the "respiratory allergenicity" are two very 
different problems based on different routes of exposure and on 

3 0 two very different immunological mechanisms: 

The term "immunogenicity" used in connection with the 
present invention may be referred to as allergic contact 
dermatitis in a clinical setting and is a cell mediated delayed 
immune response to chemicals that contact and penetrate the skin. 
35 This cell mediated reaction is also termed delayed contact 
hypersensitivity (type IV reaction according to Gell and Combs 
classification of immune mechanisms in tissue damage) . 

The term "allergenicity" or "respiratory allergenicity" is an 
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immediate anaphylactic reaction (type I antibody-mediated reaction 
according to Cell and Combs) following inhalation of e.g. 
polypeptides . 

According to the present invention it is possible to provide 
5 polypeptides with a reduced immune response and/ or improved 
stability, which has a substantially retained residual activity. 

The allergic and the immunogenic response are in one term, at 
least in the context of the present invention called the "immune 
response" . 

10 In the first aspect the invention relates to a polypeptide- 
polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 

15 comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 

20 of the polypeptide in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

The term "parent polypeptide" refers to the polypeptide to be 
modified by coupling to polymeric molecules. The parent 
polypeptide may be a naturally-occurring (or wild-type) 

25 polypeptide or may be a variant thereof prepared by any suitable 
means. For instance, the parent polypeptide may be a variant of a 
naturally-occurring polypeptide which has been modified by 
substitution, deletion or truncation of one or more amino acid 
residues or by addition or insertion of one or more amino acid 

30 residues to the amino acid sequence of a naturally- occurring 
polypeptide. 

A "suitable attachment group" means in the context of the 
present invention any amino acid residue group on the surface of 
the polypeptide capable of coupling to the polymeric molecule in 
35 question. 

Preferred attachment groups are amino groups of Lysine 
residues and the N-terminal amino group. Polymeric molecules may 
also be coupled to the carboxylic acid groups (-COOH) of amino 
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acid residues in the polypeptide chain located on the surface. 
Carboxylic acid attachment groups may be the carboxylic acid group 
of Aspartate or Glutamate and the C-terminal COOH-group. 

A "functional site" means any amino acid residues and/or 
5 cofactors which are known to be essential for the performance of 
the polypeptide, such as catalytic activity, e.g. the catalytic 
triad residues, Histidine, Aspartate and Serine in Serine 
proteases, or e.gr. the heme group and the distal and proximal 
Histidines in a peroxidase such as the Arthromyces ramosus 
10 peroxidase. 

In the second aspect the invention relates to a method for 
preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
15 3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a 

20 suitable attachment group, and/or 

ii) substituting or deleting one or more amino acid residues 
selected in step b) at or close to the functional site(s), 

d) coupling polymeric molecules to the mutated polypeptide. . 

The invention also relates to the use of a conjugate of the 
25 invention and the method of the invention for reducing the 
immunogenicity of pharmaceuticals and reducing the allergenicity 
of industrial products. 

Finally the invention relates to compositions comprising a 
conjugate of the invention and further ingredients used in 
30 industrial products or pharmaceuticals. 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 shows the anti-lipase serum antibody levels after 5 
weekly immunizations with i) control ii) unmodified lipase 
35 variant, iii) lipase variant-SPEG. (X: log(serum dilution); Y 
Optical Density (490/620)). 
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It is the object of the present invention to provide improved 
polypeptide-polymer conjugates suitable for industrial and 
pharmaceutical applications • 

Even though polypeptides used for pharmaceutical applications 
5 and industrial application can be quite different the principle of 
the present invention may be tailored to the specific type of 
parent polypeptide (i.e. enzyme, hormone peptides etc.). 

The inventors of the present invention have provided improved 
polypeptide-polymer conjugates with a reduced immune response in 
10 comparison to conjugates prepared from the corresponding parent 
polypeptides. 

The present inventors have found that polypeptides, such as 
enzymes, may be made less immunogenic and/or less allergenic by 
adding one or more attachment groups on the surface of the parent 
15 polypeptide. In addition thereto the inventors have found that a 
higher percentage of maintained residual functional activity may 
be obtained by removing attachment groups at or close to the 
functional site(s). 

In the first aspect the invention relates to an improved 
2 0 polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 
comparison to the number of attachment groups available on the 

2 5 corresponding parent polypeptide, and/or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 
of the polypeptide in comparison to the number of attachment 

30 groups available on the corresponding parent polypeptide. 

Whether the attachment groups should be added and/ or removed 
depends on the specific parent polypeptide. 

a) Addition of Attachment groups 
35 There may be a need for further attachment groups on the 
polypeptide if only few attachment groups are available on the 
surface of the parent polypeptide. The addition of one or more 
attachment groups by substituting or inserting one or more amino 
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acid residues on the surface of the parent polypeptide increases 
the number of polymeric molecules which may be attached in 
comparison to the corresponding parent polypeptide. Conjugates 
with an increased number of polymeric molecules attached thereto 
5 are generally seen to have a reduced immune response in comparison 
to the corresponding conjugates having fewer polymeric molecules 
coupled thereto. 

Any available amino acid residues on the surface of the 
polypeptide, preferentially not being at or close to the 
10 functional site(s), such as the active site(s) of enzymes, may in 
principle be subject to substitution and/ or insertion to provide 
additional attachment groups. 

As will be described further below the location of the 
additional coupled polymeric molecules may be of importance for 
15 the reduction of the immune response and the percentage of 
maintained residual functional activity of the polypeptide itself. 

A conjugate of the invention may typically have from 1 to 25, 
preferentially 1 to 10 or more additional polymeric molecules 
coupled to the surface of the polypeptide in comparison to the 
20 number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 

However, the optimal number of attachment group to be added 
depends (at least partly) on the surface area (i.e. molecular 
weight) of the parent polypeptide to be shielded by the coupled 
25 polymeric molecules, and further off-course also the number of 
already available attachment groups on the parent polypeptide. 

b) Removing Attachment groups 

In the case of enzymes or other polypeptides performing their 

3 0 function by interaction with a substrate or the like, polymeric 
molecules coupled to the polypeptide might be impeded by the 
interaction between the polypeptide and its substrate or the like, 
if they are coupled at or close to the functional site(s) (i.e. 
active site of enzymes) . This will most probably cause reduced 

35 activity. 

In the case of enzymes having one or more polymeric molecules 
coupled at or close to the active site a substantial loss of 
residual enzymatic activity can be expected. Therefore, according 
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to the invention conjugates may be constructed to maintain a 

higher percentage of residual enzymatic activity in comparison to 

a corresponding conjugates prepared on the basis of the parent 

enzyme in question. This may be done by substituting and/or 
5 deleting attachment groups at or close to the active site, hereby 

increasing the substrate affinity by improving the accessibility 

of the substrate in the catalytic cleft. 

An enzyme-polymer conjugate of the invention may typically have 

from 1 to 25, preferably 1 to 10 fewer polymeric molecules coupled 
10 at or close to the active site in comparison to the number of 

polymeric molecules of a conjugate prepared on the basis of the 

corresponding parent polypeptide. 

As will be explained below "at or close to" the functional 

site(s) means that no polymeric molecule (s) should be coupled 
15 within 5 A, preferably 8 A, especially 10 A of the functional 

site(s) . 

Removal of attachment groups at or close to the functional 
site(s) of the polypeptide may advantageously be combined with 
addition of attachment groups in other parts of the surface of the 
20 polypeptide. 

The total number of attachment groups may this way be 
unchanged, increased or decreased. However the location (s) of the 
total number of attachment group (s) is (are) improved assessed by 
the reduction of the immune response and/ or percentage of 
25 maintained residual activity. Improved stability may also be 
obtained this way. 

The number of attachment groups 

Generally seen the number of attachment groups should be 
30 balanced to the molecular weight and/or surface area of the 
polypeptide. The more heavy the polypeptide is the more polymeric 
molecules should be coupled to the polypeptide to obtain 
sufficient shielding of the epitope (s) responsible for antibody 
formation. 

35 Therefore, if the parent polypeptide molecule is relatively 
light (e.g. 1 to 35 kDa) it may be advantageous to increase the 
total number of coupled polymeric molecules (outside the 
functional site(s)) to a total between 4 and 20. 
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If the parent polypeptide molecules is heavier, for instance 35 
to 60 kDa, the number of coupled polymeric molecules (outside the 
functional site(s)) may advantageously be increased to 7 to 40, 
and so on. 

5 The ratio between the molecular weight (Mw) of the polypeptide 
in question and the number of coupled polymeric molecules 
considered to be suitable by the inventors is listed below in 
Table 1. 

10 Table 1 



Molecular weight of parent 
polypeptide (M w ) kDa 


Number of polymeric 
molecules coupled to the 
polypeptide 


1 to 35 


4-20 


35 to 60 


7-40 


60 to 80 


10-50 


80 to 100 


15-70 


more than 100 


more than 20 



Reduced immune response vs> maintained residual enzvmatic a ctivity 
Especially for enzymes, in comparison to many other types of 
polypeptides, there is a conflict between reducing the immune 

15 response and maintaining a substantial residual enzymatic activity 
as the activity of enzymes are connected with interaction between 
a substrate and the active site often present as a cleft in the 
enzyme structure. 

Without being limited to any theory it is believed that the 

20 loss of enzymatic activity of enzyme-polymer conjugates might be a 
consequence of impeded access of the substrate to the active site 
in the form of spatial hindrance of the substrate by especially 
bulky and/or heavy polymeric molecules to the catalytic cleft. It 
might also, at least partly, be caused by disadvantageous minor 

25 structural changes of the 3D structure of the enzyme due to the 
stress made by the coupling of the polymeric molecules. 

Maintained residual activity 

A polypeptide-polymer conjugates of the invention has a 
30 substantially maintained functional activity. 
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A "substantially" maintained functional activity is in the 
context of the present invention defined as an activity which is 
at least between 20% and 30%, preferably between 30% and 40%, more 
preferably between 40% and 60%, better from 60% up to 80%, even 
5 better from 80% up to about 100%, in comparison to the activity of 
the conjugates prepared on the basis of corresponding parent 
polypeptides. 

In the case of polypeptide-polymer conjugates of the 
invention where no polymeric molecules are coupled at or close to 

10 the functional site(s) the residual activity may even be up to 
100% or very close thereto. If attachment group (s) of the parent 
polypeptide is (are) removed from the functional site the activity 
might even be more than 100% in comparison to modified (i.e. 
polymer coupled) parent polypeptide conjugate. 

15 Position of coupled polymeric molecules 

To obtain an optimally reduced immune response (i.e. 
immunogenic and allergenic response) the polymeric molecules 
coupled to the surface of the polypeptide in question should be 
located in a suitable distance from each other. 

20 In a preferred embodiment of the invention the parent 
polypeptide is modified in a manner whereby the polymeric 
molecules are spread broadly over the surface of the polypeptide. 
In the case of the polypeptide in question has enzymatic activity 
it is preferred to have as few as possible, especially none, 

25 polymeric molecules coupled at or close to the area of the active 
site. 

In the present context "spread broadly over the surface of the 
polypeptide" means that the available attachment groups are 
located so that the polymeric molecules shield different parts of 
30 the surface, preferable the whole or close to the whole surface 
area away from the functional site(s) , to make sure that 
epitope (s) are shielded and hereby not recognized by the immune 
system or its antibodies. 

The area of antibody-polypeptide interaction typically 
35 covers an area of 500 A 2 , as described by Sheriff et al. 

(1987), Proc. Natl. Acad. Sci. USA 84, p. 8075-8079. 500 A 2 
corresponds to a rectangular box of 25 i x 20 i or a circular 
region of radius 12.6 A. Therefore, to prevent binding of 
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antibodies to the epitope (s) to the polypeptide in question it 
is preferred to have a maximum distance between two attachment 
groups around 10 A. 

Consequently, amino acid residues which are located in excess 
5 of 10 A away from already available attachment groups are 

suitable target residues. If two or more attachment groups on the 
polypeptide are located very close to each other it will in most 
cases result in that only one polymeric molecule will be coupled. 
To ensure a minimal loss of functional activity it is preferred 

10 not to couple polymeric molecules at or close to the functional 
site(s). Said distance depends at least partly on the bulkiness of 
the polymeric molecules to be coupled, as impeded access by the 
bulky polymeric molecules to the functional site is undesired. 
Therefore, the more bulky the polymeric molecules are the longer 

15 should the distance from the functional site to the coupled 
polymeric molecules be. 

To maintain a substantial functional activity of the 
polypeptide in question attachment groups located within 5 A, 
preferred 8 A, especially 10 A from such functional site(s) 

20 should be left uncoupled and may therefore advantageously be 
removed or changed by mutation. Functional residues should 
normally not be mutated/ removed, even though they potentially 
can be the target for coupling polymeric molecules. In said 
case it may thus be advantageous to chose a coupling chemistry 

25 involving different attachment groups. 

Further, to provide a polypeptide having coupled polymeric 
molecules at (a) known epitope (s) recognizable by the immune 
system or close to said epitope (s) specific mutations at such 
sites are also considered advantageous according to the invention. 

30 If the position of the epitope (s) is (are) unknown it is 
advantageous to couple several or many polymeric molecules to the 
polypeptide. 

As also mentioned above it is preferred that said attachment 
groups are spread broadly over the surface. 

35 

The attachment group 

Virtually all ionized groups, such as the amino groups of 
Lysine residues, are located on the surface of the polypeptide 
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molecule (see for instance Thomas E. Creighton, (1993), 
"Proteins", W.H. Freeman and Company, New York). 

Therefore, the number of readily accessible attachment groups 
(e.g. amino groups) on a modified or parent polypeptide equals 
5 generally seen the number of Lysine residues in the primary 
structure of the polypeptide plus the N-terminus amino group. 

The chemistry of coupling polymeric molecules to amino groups 
are quite simple and well established in the art. Therefore, it is 
preferred to add and/or remove Lysine residues (i.e. attachment 
10 groups) to/ from the parent polypeptide in question to obtain 
improved conjugates with reduced immunogenic ity and/ or 
allergenicity and/or improved stability and/or high percentage 
maintained functional activity. 

Polymeric molecules may also be coupled to the carboxylic 
15 groups (-C00H) of amino acid residues on the surface of the 
polypeptide. Therefore, if using carboxylic groups (including the 
C- terminal group) as attachment groups addition and/ or removal of 
Aspartate and Glutamate residues may also be a suitable according 
to the invention. 

20 If using other attachment groups, such as -SH groups, they 
may be added and/ or removed analogously. 

Substitution of the amino acid residues is preferred over 
insertion, as the impact on the 3D structure of the polypeptide 
normally will be less pronounced. 

25 Preferred substitutions are conservative substitutions. In the 
case of increasing the number of attachment groups the 
substitution may advantageously be performed at a location having 
a distance of 5 A, preferred 8 A, especially 10 A from the 
functional site(s) (active site for enzymes). 

30 An example of a suitable conservative substitution to obtain 

an additional amino attachment group is a Arginine to Lysine 
substitution. Examples of conservative substitutions to obtain 
additional carboxylic attachment groups are Aspargine to 
Aspartate/ Glutamate or Glutamine to Aspartate/ Glutamate 

35 substitutions. To remove attachment groups a Lysine residue may be 
substituted with a Arginine and so on. 



The parent polypeptide 
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In the context of the present invention the term "polypeptides" 
includes proteins, peptides and/ or enzymes for pharmaceutical or 
industrial applications. Typically the polypeptides in question 
have a molecular weight in the range between about 1 to 100 kDa, 
5 often 15 kDa and 100 kDa. 

Pharmaceutical polypeptides 

The term "pharmaceutical polypeptides" is defined as polypep- 
tides, including peptides, such as peptide hormones, proteins 
10 and/or enzymes, being physiologically active when introduced into 
the circulatory system of the body of humans and/or animals. 

Pharmaceutical polypeptides are potentially immunogenic as they 
are introduced into the circulatory system. 

Examples of "pharmaceutical polypeptides" contemplated 
15 according to the invention include insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, 
pigmentary hormones, somatomedin, erythropoietin, luteinizing 
hormone, chorionic gonadotropin, hypothalmic releasing factors, 
antidiuretic hormones, thyroid stimulating hormone, relaxin, 
20 interferon, thrombopoietin (TPO) and prolactin. 

Industrial polypeptides 

Polypeptides used for industrial applications often have an 
enzymatic activity. Industrial polypeptides (e.g. enzymes) are (in 
25 contrast to pharmaceutical polypeptides) not intended to be 
introduced into the circulatory system of the body. 

It is not very like that industrial polypeptides, such as 
enzymes used as ingredients in industrial compositions and/ or 
products, such as detergents and personal care products, including 
30 cosmetics, come into direct contact with the circulatory system of 
the body of humans or animals, as such enzymes (or products 
comprising such enzymes) are not injected (or the like) into the 
bloodstream. 

Therefore, in the case of the industrial polypeptide the 
35 potential risk is respiratory allergy (i.e. IgE response) as a 
consequence of inhalation to polypeptides through the respiratory 
passage. 

In the context of the present invention "industrial polypep- 
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tides" are defined as polypeptides, including peptides, proteins 
and/ or enzymes, which are not intended to be introduced into the 
circulatory system of the body of humans and/or animals. 

Examples of such polypeptides are polypeptides, especially 
5 enzymes, used in products such as detergents, household article 
products, agrochemicals, personal care products, such as skin care 
products, including cosmetics and toiletries, oral and dermal 
pharmaceuticals, composition use for processing textiles, 
compositions for hard surface cleaning, and compositions used for 
10 manufacturing food and feed etc. 

Enzymatic activity 

Pharmaceutical or industrial polypeptides exhibiting enzymatic 
activity will often belong to one of the following groups of 

15 enzymes including Oxidoreductases (E.C. 1, "Enzyme Nomenclature, 
(1992), Academic Press, Inc.), such as laccase and Superoxide 
dismutase (SOD); Transferases, (E.C. 2), such as transglutaminases 
(TGases) ; Hydrolases (E.C. 3), including proteases, especially 
subtilisins, and lipolytic enzymes; Isomerases (E.C. 5), such as 

20 Protein disulfide Isomerases (PDI) . 

Hydrolases 

Proteolytic enzymes 

Contemplated proteolytic enzymes include proteases selected 
25 from the group of Aspartic proteases, such pepsins, Cysteine 

proteases, such as Papain, Serine proteases, such as subtilisins, 

or metallo proteases, such as Neutrase®. 

Specific examples of parent proteases include PD498 (WO 

93/24623 and SEQ ID NO. 2), Savinase® (von der Osten et al., 
30 (1993), Journal of Biotechnology, 28, p. 55+, SEQ ID NO 3), 

Proteinase K (Gunkel et al., (1989), Eur. J. Biochem, 179, p. 185- 

194), Proteinase R (Samal et al, (1990), Mol. Microbiol, 4, p. 

1789-1792), Proteinase T (Samal et al., (1989), Gene, 85, p. 329- 

333), Subtilisin DY (Betzel et al. (1993), Arch. Biophys, 302, no. 
35 2, p. 499-502), Lion Y (JP 04197182-A) , Rennilase® (Available from 

Novo Nordisk A/S) , JA16 (WO 92/17576) , Alcalase® (a natural 

subtilisin Carlberg variant) (von der Osten et al., (1993), 

Journal of Biotechnology, 28, p. 55+). 
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Lipolytic enzymes 

Contemplated lipolytic enzymes include Humicola lanuginosa 

lipases, e.g. the one described in EP 258 068 and EP 305 216 (See 
5 SEQ ID NO 6 below) , Humicola insolens, a Rhizomucor miehei lipase, 

e.g. as described in EP 238 023, Absidia sp. lipolytic enzymes (WO 

96/13578), a Candida lipase, such as a C. antarctica lipase, e.g. 

the C. antarctica lipase A or B described in EP 214 761, a 

Pseudomonas lipase such as a P. alcaligenes and P. 
10 pseudoalcaligenes lipase, e.g. as described in EP 218 272, a P. 

cepacia lipase, e.g. as described in EP 331 376, a Pseudomonas sp. 

lipase as disclosed in WO 95/14783, a Bacillus lipase, e.g. a B. 

subtilis lipase (Dartois et al., (1993) Biochemica et Biophysica 

acta 1131, 253-260), a B. stearothermophilus lipase (JP 64/744992) 
15 and a B. pumilus lipase (WO 91/16422). Other types of lipolytic 

include cutinases, e.g. derived from Pseudomonas mendocina as 

described in WO 88/09367, or a cutinase derived from Fusarium 

solani pisi {e.g. described in WO 90/09446). 

2 0 Oxidoreductases 

Laccases 

Contemplated laccases include Polyporus pinisitus laccase (WO 
96/00290), Myceliophthora laccase (WO 95/33836), Schytalidium 
laccase (WO 95/338337), and Pyricularia oryzae laccase (Available 
25 from Sigma; . 

Peroxidase 

Contemplated peroxidases include B. pumilus peroxidases (WO 
91/05858), Myxococcaceae peroxidase (WO 95/11964), Coprinus 

3 0 cinereus (WO 95/10602) and Arthromyces ramosus peroxidase 

(Kunishima et al. (1994), J. Mol. Biol. 235, p. 331-344). 

Transferases 

Transglutaminases 

35 Suitable transferases include any transglutaminases disclosed 
in WO 96/06931 (Novo Nordisk A/S) and WO 96/22366 (Novo Nordisk 
A/S). 
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Isomerases 

Protein Disulfide Isomerase 

Without being limited thereto suitable protein disulfide 
isomerases include PDIs described in WO 95/01425 (Novo Nordisk 
5 A/S) • 

The polymeric molecule 

The polymeric molecules coupled to the polypeptide may be any 
suitable polymeric molecule, including natural and synthetic homo- 

10 polymers, such as polyols (i.e. poly-OH) , polyamines (i.e. poly- 
NH 2 ) and polycarboxyl acids (i.e. poly-COOH) , and further hetero- 
polymers i.e. polymers comprising one or more different coupling 
groups e.g. a hydroxy 1 group and amine groups. 

Examples of suitable polymeric molecules include polymeric 

15 molecules selected from the group comprising polyalkylene oxides 
(PAO) , such as polyalkylene glycols (PAG) , including polyethylene 
glycols (PEG) , methoxypolyethylene glycols (mPEG) and polypropylen 
glycols, PEG-glycidyl ethers (Epox-PEG) , PEG-oxycarbonylimidazole 
(CDI-PEG) , Branced PEGs, poly-vinyl alcohol (PVA) , poly- 

20 carboxylates, poly-(vinylpyrolidone) , poly-D,L-amino acids, 
polyethylene-co-maleic acid anhydride, polystyrene-co-malic acid 
anhydrid, dextrans including carboxymethyl-dextrans, heparin, 
homologous albumin, celluloses, including methylcellulose, 
carboxymethylcellulose, ethylcellulose, hydroxyethylcellulose 

25 carboxyethylcellulose and hydroxypropylcellulose, hydrolysates of 
chitosan, starches such as hydroxyethyl-straches and hydroxy 
propyl-starches, glycogen, agaroses and derivates thereof, guar 
gum, pullulan, inulin, xanthan gum, carrageenin, pectin, alginic 
acid hydrolysates and bio-polymers. 

30 Preferred polymeric molecules are non-toxic polymeric molecules 
such as (m) polyethylene glycol ( (m)PEG) which further requires a 
relatively simple chemistry for its covalently coupling to 
attachment groups on the enzyme *s surface. 

Generally seen polyalkylene oxides (PAO) , such as polyethylene 

35 oxides, such as PEG and especially mPEG, are the preferred 
polymeric molecules, as these polymeric molecules, in comparison 
to polysaccharides such as dextran, pullulan and the like, have 
few reactive groups capable of cross-linking. 
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Even though all of the above mentioned polymeric molecules may 
be used according to the invention the methoxypolyethylene glycols 
(mPEG) may advantageously be used. This arise from the fact that 
methoxyethylene glycols have only one reactive end capable of 
5 conjugating with the enzyme. Consequently, the risk of cross- 
linking is less pronounced. Further, it makes the product more 
homogeneous and the reaction of the polymeric molecules with the 
enzyme easier to control. 

10 Preparation of enzvme variants 

Enzyme variants to be conjugated may be constructed by any 
suitable method. A number of methods are well established in 
the art. For instance enzyme variants according to the 
invention may be generated using the same materials and methods 

15 described in e.g. WO 89/06279 (Novo Nordisk A/S) , EP 130,756 
(Genentech) , EP 479,870 (Novo Nordisk A/S), EP 214,435 
(Henkel) , WO 87/04461 (Amgen) , WO 87/05050 (Genex) , EP appli- 
cation no. 87303761 (Genentech), EP 260,105 (Genencor) , WO 
88/06624 (Gist-Brocades NV) , WO 88/07578 (Genentech), WO 

20 88/08028 (Genex), WO 88/08033 (Amgen), WO 88/08164 (Genex), 
Thomas et al. (1985) Nature, 318 375-376; Thomas et al. (1987) 
J. Mol. Biol., 193, 803-813; Russel and Fersht (1987) Nature 
328 496-500. 

25 Generation of site directed mutations 

Prior to mutagenesis the gene encoding the polypeptide of 
interest must be cloned in a suitable vector. Methods for 
generating mutations in specific sites is described below. 

Once the polypeptide encoding gene has been cloned, and 

30 desirable sites for mutation identified and the residue to 
substitute for the original ones have been decided, these 
mutations can be introduced using synthetic oligonucleotides. 
These oligonucleotides contain nucleotide sequences flanking the 
desired mutation sites; mutant nucleotides are inserted during 

35 oligo-nucleotide synthesis. In a preferred method, Site-directed 
mutagenesis is carried out by SOE-PCR mutagenesis technique 
described by Kammann et al. (1989) Nucleic Acids Research 17(13), 
5404, and by Sarkar G. and Soramer, S.S. (1990); Biotechniques 8, 
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404-407, 

Activation of polymers 

If the polymeric molecules to be conjugated with the 
5 polypeptide in question are not active it must be activated by the 
use of a suitable technique. It is also contemplated according to 
the invention to couple the polymeric molecules to the polypeptide 
through a linker. Suitable linkers are well-known to the skilled 
person. 

10 Methods and chemistry for activation of polymeric molecules 
as well as for conjugation of polypeptides are intensively 
described in the literature. Commonly used methods for activation 
of insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glutaraldehyde, biepoxides, 

15 epichlorohydrin, divinylsulf one, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 
immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking" , CRC Press, Boca Raton; G.T. Hermanson et al., 

20 (1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 
polymers but are also applicable to activation of soluble polymers 
e.g. periodate , trichlorotriazine , sulf onylhalides , 

divinylsulfone, carbodiimide etc. The functional groups being 

25 amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 
considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

30 In the following a number of suitable polymer activation 
methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of poly- 
peptides may be performed with the aid of diimide and for example 

35 amino-PEG or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. 
Soc, 98, 289-291) or diazoacetate/ amide (Wong et al., (1992), 
"Chemistry of Protein Conjugation and Crosslinking", CRC Press). 
Coupling polymeric molecules to hydroxy groups are generally 
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very difficult as it must be performed in water. Usually 
hydrolysis predominates over reaction with hydroxy 1 groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
5 disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 
selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

10 Techniques involving coupling electrophilically activated 
PEGs to the amino groups of Lysines may also be useful. Many of 
the usual leaving groups for alcohols give rise to an amine 
linkage. For instance, alkyl sulfonates, such as tresylates 
(Nilsson et al. , (1984), Methods in Enzymology vol. 104, Jacoby, 

15 W. B., Ed., Academic Press: Orlando, p. 56-66; Nilsson et al., 
(1987), Methods in Enzymology vol. 135; Mosbach, K. , Ed.; Academic 
Press: Orlando, pp. 65-79; Scouten et al., (1987), Methods in 
Enzymology vol. 135, Mosbach, K. , Ed., Academic Press: Orlando, 
1987; pp 79-84; Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 

20 93, pp. 4217-4219), mesylates (Harris, (1985), supra; Harris et 
al., (1984), J. Polym. Sci. Polym. Chem. Ed. 22, pp 341-352), aryl 
sulfonates like tosylates, and para-nitrobenzene sulfonates can be 
used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
25 converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with nucleo- 
philes like amino groups in polypeptides allow stable linkages to 
be formed between polymer and polypeptide. In addition to high 
conjugation yields, the reaction conditions are in general mild 
30 (neutral or slightly alkaline pH, to avoid denaturation and little 
or no disruption of activity) , and satisfy the non-destructive re- 
quirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more un- 
stable decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 
35 (1995), Bioconjugate Chem., 6, 150-165). Epoxides may also been 
used for creating amine bonds but are much less reactive than the 
above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise 
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to carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), Biocon- 
5 jugate Chem. , 6, 62-69, with imidazole (Allen et al., (1991), 
Carbohydr. Res., 213, pp 309-319), with para-nitrophenol, DMAP (EP 
632 082 Al, (1993), Looze, Y.) etc. The derivatives are usually 
made by reacting the chlorof ormate with the desired leaving group. 
All these groups give rise to carbamate linkages to the peptide. 

10 Furthermore, isocyanates and isothiocyanates may be employed 

yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these com- 

15 pounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 
(1992), Zalipsky) . This group may be activated with N-hydroxy suc- 

20 cinimide. 

Furthermore, a special linker can be introduced. The oldest 
being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
25 375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
30 Greenwald, R. B.) thus introducing an additional amide linkage. 

As some peptides do not comprise many Lysines it may be 
advantageous to attach more than one PEG to the same Lysine. This 
can be done e.g. by the use of 1, 3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
35 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

The coupling technique used in the examples is the N- 
succinimidyl carbonate conjugation technique descried in WO 
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90/13590 (Enzon) . 

Method for preparing improved conjugates 

It is also an object of the invention to provide a method for 
5 preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
10 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
15 selected in step b) at or close to the functional site(s), 

d) coupling polymeric molecules to the mutated polypeptide. 

Step a) Identifying amino acid residues located on the surface of 
the parent polypeptide 

20 

3-dimensional structure ( 3D-structure) 

To perform the method of the invention a 3-dimensional 
structure of the parent polypeptide in question is required. 
This structure may for example be an X-ray structure, an NMR 
25 structure or a model-built structure. The Brookhaven Databank 
is a source of X-ray- and NMR-structures . 

A model-built structure may be produced by the person 
skilled in the art if one or more 3D-structure(s) exist (s) of 
homologous polypeptide (s) sharing at least 30% sequence 
30 identity with the polypeptide in question. Several software 
packages exist which may be employed to construct a model 
structure. One example is the Homology 95.0 package from 
Biosyro. 

Typical actions required for the construction of a model 
35 structure are: alignment of homologous sequences for which 3D- 
structures exist, definition of Structurally Conserved Regions 
(SCRs), assignment of coordinates to SCRs, search for 
structural fragments/ loops in structure databases to replace 
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Variable Regions, assignment of coordinates to these regions, 
and structural refinement by energy minimization. Regions 
containing large inserts (£3 residues) relative to the known 
3D-structures are known to be quite difficult to model, and 
5 structural predictions must be considered with care. 

Having obtained the 3D-structure of the polypeptide in 
question, or a model of the structure based on homology to 
known structures, this structure serves as an essential 
prerequisite for the fulfillment of the method described below. 

10 

Step b) Selection of target amino acid residues f or mutation 
Target amino acid residues to be mutated are according to 
the invention selected in order to obtain additional or fewer 
attachment groups, such as free amino groups (-NH 2 ) or free 
15 carboxylic acid groups (-COOH) , on the surface of the 

polypeptide and/or to obtain a more complete and broadly spread 
shielding of the epitope(s) on the surface of the polypeptide. 

Conservative substitution 
20 It is preferred to make conservative substitutions in the 

polypeptide, as conservative substitutions secure that the 

impact of the mutation on the polypeptide structure is limited. 
In the case of providing additional amino groups this may be 

done by substitution of Arginine to Lysine, both residues being 
25 positively charged, but only the Lysine having a free amino 

group suitable as an attachment groups. 

In the case of providing additional carboxylic acid groups 

the conservative substitution may for instance be an Aspargine 

to Aspartic acid or Glutamine to Glutamic acid substitution. 
3 0 These residues resemble each other in size and shape, except 

from the carboxylic groups being present on the acidic 

residues. 

In the case of providing fewer attachment groups, e.g. at or 
close to the active site, a Lysine may be substituted with a 
35 Arginine, and so on. 

Which amino acids to substitute depends in principle on the 
coupling chemistry to be applied. 
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Non-conservative substitution 

The mutation may also be on target amino acid residues which 
are less/non-conservative. Such mutation is suitable for 
obtaining a more complete and broadly spread shielding of the 
5 polypeptide surface than can be obtained by the conservative 
substitutions . 

The method of the invention is first described in general 
terms, and subsequently using specific examples. 

Note the use of the following terms: 
10 Attachment_residue: residue (s) which can bind polymeric 

molecules, e.g. Lysines (amino group) or Aspartic/Glutamic 
acids (carboxylic groups) . N- or C-terminal amino/ carboxy lie 
groups are to be included where relevant. 

Mutation_residue: residue (s) which is to be mutated, e.g. 
15 Arginine or Aspargine/Glutamine. 

Essential_catalytic_residues: residues which are known to be 
essential for catalytic function, e.g. the catalytic triad in 
Serine proteases. 

Solvent_exposed_residues: These are defined as residues which 
20 are at least 5% exposed according to the BIOSYM/ INSIGHT 

algorithm found in the module Homology 95.0. The sequence of 
commands are as follows: 

Homology==>ProStat=>Access_Surf=>Solv_Radius 1.4; Heavy atoms 
only; Radii source VdW; Output: Fractional Area; Polarity 
25 source: Default. The file f ilename__area. tab is produced. Note: 
For this program to function properly all water molecules must 
first be removed from the structure. 
It looks for example like: 
# PD4 9 8 FINALMODEL 



30 


# residue 


area 




TRP_1 


136.275711 




SER_2 


88.188095 




PR03 


15.458788 




ASN_4 


95.322319 


35 


ASP_5 


4.903404 




PR0_6 


68.096909 




TYR_7 


93.333252 




TYR 8 


31.791576 
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SER_9 95.983139 
» . continued 

1. Identification of residues which are more than 10 A away 

5 from the closest attachmentjresidue, and which are located at 
least 8 A away from essential_catalyticjresidues. This residue 
subset is called REST, and is the primary region for 
conservative mutationjresidue to attachment_residue 
substitutions. 

10 

2. Identification of residues which are located in a 0-5 A 
shell around subset REST, but at least 8 A away from 
essential jsatalytic_residues. This residue subset is called 
SUB5B. This is a secondary region for conservative 

15 mutationjresidue to attachmentjresidue substitutions, as a 

ligand bound to an attachmentjresidue in SUB5B will extend into 
the REST region and potentially prevent epitope recognition. 

3. Identification of solvent_exposed mutationjresidues in REST 
20 and SUB5B as potential mutation sites for introduction of 

attachment jresidues . 

4. Use BIOS YM/ INSIGHT'S Biopolymer module and replace residues 
identified under action 3. 

25 

5. Repeat 1-2 above producing the subset RESTx. This subset 
includes residues which are more than 10 A away from the 
nearest attachmentjresidue, and which are located at least 8 A 
away from essential catalytic residues. 

30 

6. Identify sol vent_exposed jresidues in RESTx. These are 
potential sites for less/non-conservative mutations to 
introduce atttachment jresidues. 

35 

Step c) Substituting, inserting or deleting amino acid residues 

The mutation (s) performed in step c) may be performed by 
standard techniques well known in the art, such as site-directed 
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mutagenesis (see, e.g., Sambrook et al. (1989), Sambrook et al., 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. 

A general description of nucleotide substitution can be found 
in e.g. Ford et al., 1991, Protein Expression and Purification 2, 
5 p. 95-107. 

Step d) Coupling polymeric molecules to the mod ified parent enzyme 
Polypeptide-polymer conjugates of the invention may be 
prepared by any coupling method known in the art including the 
10 above mentioned techniques. 

Coupling of polymeric molecules to the polype ptide in question 

If the polymeric molecules to be conjugated with the 
polypeptide are not active it must be activated by the use of a 

15 suitable method. The polymeric molecules may be coupled to the 
polypeptide through a linker. Suitable linkers are well known to 
the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described 

20 in the literature. Commonly used methods for activation of 
insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glutaraldehyde, biepoxides, 
epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 

25 immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking" , CRC Press, Boca Raton; G.T. Hermanson et al., 
(1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 

3.0 polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulfonylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 
amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 

3 5 considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation 
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methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of enzymes 
can be performed with the aid of diimide and for example amino-PEG 
5 or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. Soc, 98, 
289-291) or diazoacetate/ amide (Wong et al., (1992), "Chemistry of 
Protein Conjugation and Crosslinking", CRC Press). 

Coupling polymeric molecules to hydroxy groups are generally 
very difficult as it must be performed in water. Usually 

10 hydrolysis predominates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 

15 selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

Techniques involving coupling electrophilically activated PEGs 
to the amino groups of Lysines are also be useful. Many of the 

20 usual leaving groups for alcohols give rise to an amine linkage. 
For instance, alkyl sulfonates, such as tresylates (Nilsson et 
al., (1984), Methods in Enzymology vol. 104, Jacoby, W. B., Ed., 
Academic Press: Orlando, p. 56-66; Nilsson et al., (1987), Methods 
in Enzymology vol. 135; Mosbach, K. , Ed.; Academic Press: Orlando, 

25 pp. 65-79; Scouten et al., (1987), Methods in Enzymology vol. 135, 
Mosbach, K. , Ed., Academic Press: Orlando, 1987; pp 79-84; 
Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 93, pp. 4217-4- 
219), mesylates (Harris, (1985), supra : Harris et al., (1984), J. 
Polym. Sci. Polym. Chem. Ed. 22, pp. 341-352), aryl sulfonates 

30 like tosylates, and para-nitrobenzene sulfonates can be used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with 
nucleophiles like amino groups in polypeptides allow stable 

35 linkages to be formed between polymer and polypeptide. In addition 
to high conjugation yields, the reaction conditions are in general 
mild (neutral or slightly alkaline pH, to avoid denaturation and 
little or no disruption of activity) , and satisfy the non- 
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destructive requirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more 
unstable decomposing into PEG, dioxane, and sulfonic acid 
(Zalipsky, (1995), Bioconjugate Chem., 6, 150-165). Epoxides may 
5 also been used for creating amine bonds but are much less reactive 
than the above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise to 
carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
10 patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), 
Bioconjugate Chem. , 6, 62-69, with imidazole (Allen et al., 

(1991) , Carbohydr. Res., 213, pp 309-319), with para-nitrophenol , 
DMAP (EP 632 082 Al, (1993), Looze, Y. ) etc. The derivatives are 

15 usually made by reacting the chloroformate with the desired 
leaving group. All these groups give rise to carbamate linkages to 
the peptide. 

Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 
20 Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these 
compounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
25 also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 

(1992) , Zalipsky). This group may be activated with N-hydroxy 
succinimide. 

Furthermore, a special linker can be introduced. The oldest 
30 being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
35 yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
Greenwald, R. B.) thus introducing an additional amide linkage. 
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As some peptides do not comprise many Lysines it may be advan- 
tageous to attach more than one PEG to the same Lysine. This can 
be done e.g. by the use of 1, 3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
5 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

Addition of attachment groups 

Specific examples of PD498 variant-SPEG conjugat es 
10 A specific example of a protease is the parent PD498 (WO 
93/24623 and SEQ ID NO. 2). The parent PD498 has a molecular 
weight of 29 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from the 


Arginine 


Lysine 


active site 






0-5 A 


1 




5-10 A 






10-15 A 


5 


6 


15-20 A 


2 


3 


20-25 A 


1 


3 


total 


9 


12 



15 The inventors examined which parent PD498 sites on the surface 
may be suitable for introducing additional attachment groups. 

A. Suitable conservative Arginine to Lysine substitutions in 
parent PD498 may be any of R51K, R62K, R121K, R169K, R250K, R28K, 
R190K. 

20 B. Suitable non-conservative substitutions in parent PD498 may 
be any of P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, 
N65K, G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, 
S220K, Y221K, G262K. 

As there is no Lysine residues at or close to the active site 

25 there is no need for removing any attachment group. 

PD498 variant-SPEG conjugates may be prepared using any of the 
above mentioned PD498 variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 

30 described below. 
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Removal of attachment groups 

Specific examples of BPN~ variant-SPEG conjugates 

A specific example of a protease having an attachment group in 
5 the active site is BPflT which has 11 attachment groups (plus an N- 
terminal amino group): BPN" has a molecular weight of 28 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from 


Arginine 


Lysine 


the active site 






0-5 A 




1 


5-10 A 






10-15 A 


1 


4 


15-20 A 


1 


4 


20-25 A 




2 


total 


2 


11 



10 The Lysine residue located within 0-5 A of the active site can 

according to the invention advantageously be removed. Specifically 

this may be done by a K94R substitution. 

BPN" variant-SPEG conjugates may be prepared using the above 

mentioned BPN" variant as the starting material by any conjugation 
15 technique known in the art for coupling polymeric molecules to 

amino groups on the enzyme. 

Addition and removal of attachment groups 

Specific example of Savinase®-SPEG -conjugates 
20 As described in Example 2 parent Savinase® (von der Osten et 

al., (1993), Journal of Biotechnology, 28, p. 55+ and SEQ ID NO. 

3) may according to the invention have added a number of amino 

attachment groups to the surface and removed an amino attachment 

group close to the active site. 
25 Any of the following substitutions in the parent Savinase® 

are sites for mutagenesis: R10K, R19K, R45K, R145K, R170K, 

R186K and R247K. 

The substitution K94R are identified as a mutation suitable 

for preventing attachment of polymers close to active site. 
30 Savinase® variant-SPEG conjugates may be prepared using any of 



L 
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the above mentioned Savinase® variants as the starting material by 
any conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. 

5 Addition of attachment groups 

A specific examples of Humicola lanuginosa lipase va riants-SPEG 
conjugates 

Specific examples of lipase variants with reduced 
immunogenicity using the parent Huminocal lanuginosa DSM 4109 
10 lipase (see SEQ ID No 6) as the backbone for substitutions are 
listed below. 

The parent unmodified Humicola lanuginosa lipase has 8 
attachment groups including the N-terminal NH 2 group and a 
molecular weight of about 29 kDa. 
15 A. Suitable conservative Arginine to Lysine substitutions in the 
parent lipase may be any of R133K, R139K, R160K, R179K, R209K, 
R118K and R125K. 

Suitable non-conservative substitutions in the parent lipase 
may be any of: 

20 A18K / G31K # T32K / N33K / G38K f A40K / D48K / T50K,E56K,D57K f S58K,G59K, 
V60K,G61K / D62K f T64K # L78K / N88K / G91K / N92K / L93K,S105K,G106K / 
V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 

Further suitable non-conservative substitution in the Humicola 
lanuginosa lipase include: E87K or D254K. 

25 Lipase variant-SPEG conjugates may be prepared using any of the 
above mentioned lipase variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 
described below. 

30 In Example 12 below is it shown that a conjugate of the 
Humicola lanuginosa lipase variant with a E87K+D254K substitutions 
coupled to S-PEG 15,000 has reduced immunogenic response in Balb/C 
mice in comparison to the corresponding parent unmodified enzyme. 

35 Immunogenicity and Alleraenicitv 

"Immunogenicity" is a wider term than "antigenicity" and 
"allergenicity", and expresses the immune system's response to the 
presence of foreign substances. Said foreign substances are called 
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immunogens , antigens and allergens depending of the type of immune 
response the elicit. 

An "immunogen" may be defined as a substance which, when intro- 
duced into circulatory system of animals and humans, is capable of 
5 stimulating an immunologic response resulting in formation of 
immunoglobulin. 

The term "antigen" refers to substances which by themselves are 
capable of generating antibodies when recognized as a non-self 
molecule. 

10 Further, an "allergen" may be defined as an antigen which may 
give rise to allergic sensitization or an allergic response by IgE 
antibodies (in humans, and molecules with comparable effects in 
animals) . 

15 Assessment of immunoaencitv 

Assessment of the immunogenicity may be made by injecting 
animal subcutaneous ly to enter the immunogen into the circulation 
system and comparing the response with the response of the 
corresponding parent polypeptide. 

20 The "circulatory system" of the body of humans and animals 
means, in the context of the present invention, the system which 
mainly consists of the heart and blood vessels. The heart delivers 
the necessary energy for maintaining blood circulation in the 
vascular system. The circulation system functions as the 

25 organism's transportation system, when the blood transports 0 2 , 
nutritious matter, hormones, and other substances of importance 
for the cell regulation into the tissue. Further the blood removes 
C0 2 from the tissue to the lungs and residual substances to e.g. 
the kidneys. Furthermore, the blood is of importance for the 

3 0 temperature regulation and the defence mechanisms of the body, 
which include the immune system. 

A number of in vitro animal models exist for assessment of the 
immunogenic potential of polypeptides. Some of these models give a 
suitable basis for hazard assessment in man. Suitable models 

35 include a mice model. 

This model seek to identify the immunogenic response in the 
form of the IgG response in Balb/C mice being injected 
subcutaneous ly with modified and unmodified polypeptides. 
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Also other animal models can be used for assessment of the 
immunogenic potential. 

A polypeptide having "reduced immunogenicity" according to the 
invention indicates that the amount of produced antibodies, e.g. 
5 immunoglobulin in humans, and molecules with comparable effects in 
specific animals, which can lead to an immune response, is 
significantly decreased, when introduced into the circulatory 
system, in comparison to the corresponding parent polypeptide. 

For Balb/C mice the IgG response gives a good indication of the 
10 immunigenic potential of polypeptides. 

Assessment of alleraenicitv 

Assessment of allergenicity may be made by inhalation tests, 
comparing the effect of intratracheally (into the trachea) 

15 administrated parent enzymes with the corresponding modified 
enzymes according to the invention. 

A number of in vivo animal models exist for assessment of the 
allegenicity of enzymes. Some of these models give a suitable 
basis for hazard assessment in man. Suitable models include a 

20 guinea pig model and a mouse model. These models seek to identify 
respiratory allergens as a function of elicitation reactions 
induced in previously sensitised animals. According to these 
models the alleged allergens are introduced intratracheally into 
the animals. 

25 A suitable strain of guinea pigs, the Dunkin Hartley strain, do 
not as humans, produce IgE antibodies in connection with the 
allergic response. However, they produce another type of antibody 
the IgGIA and IgGIB (see e.g. Prent0, ATLA, 19, p. 8-14, 1991), 
which are responsible for their allergenic response to inhaled 

30 polypeptides including enzymes. Therefore, when using the Dunkin 
Hartley animal model, the relative amount of IgGIA and IgGIB is a 
measure of the allergenicity level. 

The Balb/C mice strain is suitable for intratracheal exposure. 
Balb/C mice produce IgE as the allergic response. 

35 More details on assessing respiratory allergens in guinea pigs 
and mice is described by Kimber et al.,(1996), Fundamental and 
Applied Toxicology, 33, p. 1-10. 
Other animals such as rats, rabbits etc. may also be used for 
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comparable studies. 
Composition 

The invention relates to a composition comprising a 
5 polypeptide-polymer conjugate of the invention. 

The composition may be a pharmaceutical or industrial 
composition. 

The composition may further comprise other polypeptides, 
proteins or enzymes and/or ingredients normally used in e.g. 

10 detergents, including soap bars, household articles, 
agrochemicals, personal care products, including skin care 
compositions, cleaning compositions for e.g. contact lenses, oral 
and dermal pharmaceuticals, composition use for treating textiles, 
compositions used for manufacturing food, e.g. baking, and feed 

15 etc. 

Use of the polypeptide-polymer conjugate 

The invention also relates to the use of the method of the 
invention for reducing the immune response of polypeptides. 
20 It is also an object of the invention to use the polypeptide- 

polymer conjugate of the invention to reduce the allergenicity of 
industrial products, such as detergents, such as laundry, disk 
wash and hard surface cleaning detergents, and food or feed 
products . 

25 

MATERIAL AND METHODS 
Materials 

Enzymes: 

PD498: Protease of subtilisin type shown in WO 93/24623. The 
3 0 sequence of PD498 is shown in SEQ ID NO. 1 and 2. 
Savinase® (Available from Novo Nordisk A/S) 

Humicola lanuginosa lipase: Available from Novo Nordisk as 
lipolase® and is further described in EP 305,216. The DNA and 
protein sequence is shown in SEQ ID NO 5 and 6, respectively. 
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Strains: 

B. subtilis 3 09 and 147 are variants of Bacillus lentus, 
deposited with the NCIB and accorded the accession numbers NCIB 
5 10309 and 10147, and described in US Patent No. 3,723,250 
incorporated by reference herein. 

E. coli MC 1000 (M.J. Casadaban and S.N. Cohen (1980); J. 
Mol. Biol. 138 179-207), was made r~,m+ by conventional methods 
and is also described in US Patent Application Serial No. 
10 039,298. 

Vectors : 

pPD498: E. coli - B . subtilis shuttle vector (described in 
US patent No. 5,621,089 under section 6.2.1.6) containing the 
15 wild-type gene encoding for PD498 protease (SEQ ID NO. 2). The 
same vector is use for mutagenesis in E. coli as well as for 
expression in B . subtilis. 

General molecular biology methods: 

20 Unless otherwise mentioned the DNA manipulations and 
transformations were performed using standard methods of 
molecular biology (Sambrook et al. (1989) Molecular cloning: A 
laboratory manual, Cold Spring Harbor lab. , Cold Spring Harbor, 
NY; Ausubel, F. M. et al. (eds.) "Current protocols in 

25 Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R. , 
and Cutting, S. M. (eds.) "Molecular Biological Methods for 
Bacillus". John Wiley and Sons, 19-90). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

30 

Materials, chemicals and solutions: 

Horse Radish Peroxidase labeled anti-rat-Ig (Dako, DK, P162, # 
031; dilution 1:1000). 
35 Mouse anti-rat IgE (Serotec MCA193; dilution 1:200). 
Rat anti-mouse IgE (Serotec MCA419; dilution 1:100). 
Biotin-labeled mouse anti-rat IgGl monoclonal antibody (Zymed 03- 
9140; dilution 1:1000) 
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Biot in- labeled rat anti-mouse IgGl monoclonal antibody (Serotec 
MCA336B; dilution 1:1000) 

Streptavidin-horse radish peroxidase (Kirkeg&rd & Perry 14-30-00; 
dilution 1:1000). 
5 CovaLink NH 2 plates (Nunc, Cat# 459439) 
• Cyanuric chloride (Aldrich) 
Acetone (Merck) 

Rat anti-Mouse IgGl, biotin (SeroTec, Cat# MCA336B) 
Streptavidin, peroxidase (KPL) 
10 Ortho-Phenylene-diamine (OPD) (Kem-en-Tec) 
H 2 0 2 , 30% (Merck) 
Tween 20 (Merck) 
Skim Milk powder (Difco) 
H 2 S0 4 (Merck) 

15 

Buffers and Solutions: 

Carbonate buffer (0.1 M, pH 10 (1 liter)) Na 2 C0 3 10.60 g 

PBS (pH 7.2 (1 liter)) NaCl 8.00 g 

KC1 0.20 g 

20 K 2 HP0 4 1.04 g 

KH 2 P0 4 0.32 g 

Washing buffer PBS, 0.05% (v/v) Tween 20 
Blocking buffer PBS, 2% (wt/v) Skim Milk powder 

Dilution buffer PBS, 0.05% (v/v) Tween 20, 0.5% (wt/v) Skim Milk 
25 powder 

Citrate buffer (0.1M, pH 5.0-5.2 (1 liter) )NaCitrate 20.60 g 

Citric acid 6.30 g 

Activation of CovaLink plates: 

■ Make a fresh stock solution of 10 mg cyanuric chloride per ml 
30 acetone. 

• Just before use, dilute the cyanuric chloride stock solution 
into PBS, while stirring, to a final concentration of lmg/ml. 

• Add 100 ml of the dilution to each well of the CovaLink NH2 
plates, and incubate for 5 minutes at room temperature. 

35 ■ Wash 3 times with PBS. 

• Dry the freshly prepared activated plates at 50 °C for 30 
minutes . 

• Immediately seal each plate with sealing tape. 
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Preactivated plates can be stored at room temperature for 3 
weeks when kept in a plastic bag. 

Sodium Borate, borax (Sigma) 
5 3,3-Dimethyl glutaric acid (Sigma) 
CaCl 2 (Sigma) 

Tresyl chloride (2,2, 2-trif louroethansulfonyl chloride) (Fluka) 
l-ethyl-3- (3-dimethylaminopropyl) carbodiimide (EDC) (Fluka) 
tf-Hydroxy succinimide (Fluka art. 56480) ) 
10 Phosgene (Fluka art. 79380) 
Lactose (Merck 7656) 

PMSF (phenyl methyl sulfonyl flouride) from Sigma 
Succinyl-Alanine-Alanine-Proline-Phenylalanine-para-nitroanilide 
(Suc-AAPF-pNP) Sigma no. S-7388, Mw 624.6 g/mole. 

15 

Colouring substrate: 

OPD: o-phenylene-diamine, (Kementec cat no. 4260) 
Test Animals: 

20 Dunkin Hartley guinea pigs (from Charles River, DE) 

Female Balb/C mice (about 20 grams) purchased from Bomholdtgaard, 
Ry, Denmark. 

Equipment : 
25 XCEL II (Novex) 

ELISA reader (UVmax, Molecular Devices) 
HPLC (Waters) 
PFLC (Pharmacia) 

Superdex-75 column, Mono-Q, Mono S from Pharmacia, SW. 
30 SLT: Fotometer from SLT Lablnstruments 

Size-exclusion chromatograph (Spherogel TSK-G2000 SW) . 
Size-exclusion chromatograph (Superdex 200, Pharmacia, SW) 
Amicon Cell 

35 Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases, ligases 
etc., are obtained from New England Biolabs. Inc. 
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Methods 

ELISA procedure for determination of IgG^ positive guinea pigs 

ELISA microtiter plates are coated with rabbit anti-PD498 
5 1:8000 in carbonate buffer and incubated over night at 4°C. The 
next day the plates is blocked with 2% BSA for 1 hour and washes 3 
times with PBS Tween 20. 

1 Jig/ml PD498 is added to the plates and incubated for 1 hour, 
then washed 3 times with PBS Tween 20. 
10 All guinea pig sera samples and controls are applied to the 

ELISA plates with 2 |il sera and 98 ^1 PBS, incubated for 1 hour 
and washed 3 times with PBS Tween 20. 

Then goat anti-guinea pig IgGi (1:4000 in PBS buffer (Nordic 
Immunology 44-682)) is applied to the plates, incubated for 1 hour 
15 and washed with PBS tween 20. 

Alkaline phosphatase marked rabbit anti-goat 1:8000 (Sigma 
A4187) is applied and incubated for 1 hour, washed 2 times in PBS 
Tween20 and 1 time with diethanol amine buffer. 

The marked alkaline phosphatase is developed using p- 
20 nitrophenyl phosphate for 30 minutes at 37°C or until appropriate 
colour has developed. 

The reaction is stopped using Stop medium (K 2 HP0 4 /HaH 3 buffer 
comprising EDTA (pH 10)) and read at OD 405/650 using a ELISA 
reader . 

25 Double blinds are included on all ELISA plates. 

Positive and negative sera values are calculated as the 
average blind values added 2 times the standard deviation. This 
gives an accuracy of 95%. 

30 Determinat ion of the molecule weight 

Electrophoretic separation of proteins was performed by standard 
methods using 4-20% gradient SDS poly acrylamide gels (Novex) . 
Proteins were detected by silver staining. The molecule weight was 
measured relative to the mobility of Mark-12® wide range molecule 

35 weight standards from Novex. 



Protease activity 
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Analysis with Suc-Ala-Ala-Pro-Phe-pNa: 

Proteases cleave the bond between the peptide and p- 
nitroaniline to give a visible yellow colour absorbing at 405 nm. 

Buffer: e.g. Britton and Robinson buffer pH 8.3 
5 Substrate: 100 mg suc-AAPF-pNa is dissolved into 1 ml dimethyl 
sulfoxide (DMSO) . 100 |il of this is diluted into 10 ml with 
Britton and Robinson buffer. 

The substrate and protease solution is mixed and the 
absorbance is monitored at 405 nm as a function of time and ABS405 
10 nm/roin. The temperature should be controlled (20-50°C depending on 
protease) . This is a measure of the protease activity in the 
sample. 

Proteolytic Activity 

15 In the context of this invention proteolytic activity is 

expressed in Kilo NOVO Protease Units (KNPU) . The activity is 
determined relatively to an enzyme standard (SAVINASE_) , and 
the determination is based on the digestion of a dimethyl 
casein (DMC) solution by the proteolytic enzyme at standard 

20 conditions, i.e. 50 °C, pH 8.3, 9 min. reaction time, 3 min. 
measuring time. A folder AF 220/1 is available upon request to 
Novo Nordisk A/S, Denmark, which folder is hereby included by 
reference. 

A GU is a Glycine Unit, defined as the proteolytic enzyme 
25 activity which, under standard conditions, during a 15-minutes* 
incubation at 40°C, with N-acetyl casein as substrate, produces 
an amount of NH2~group equivalent to 1 mmole of glycine. 

Enzyme activity can also be measured using the PNA assay, 
according to reaction with the soluble substrate succinyl- 
3 0 alanine-alanine-proline-pheny 1-alanine-para-nitrophenol , which 
is described in the Journal of American Oil Chemists Society, 
Rothgeb, T.M., Goodlander, B.D., Garrison, P.H., and Smith, 
L.A. , (1988) . 

35 Fermentation of PD498 variants 

Fermentation of PD498 variants in B . subtilis are performed 
at 30°C on a rotary shaking table (300 r.p.m.) in 500 ml baffled 
Erlenmeyer flasks containing 100 ml BPX medium for 5 days. In 
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order to make an e.g. 2 liter broth 20 Erlenmeyer flasks are 
fermented simultaneously. 

Media: 

5 BPX; Composition (per liter) 



Sodium caseinate lOg 

The starch in the medium is liquefied with a-amylase and 
the medium is sterilized by heating at 120°C for 45 minutes. 
After sterilization the pH of the medium is adjusted to 9 by 
15 addition of NaHC0 3 to 0.1 M. 

Purification of PD498 variants 

Approximately 1.6 litres of PD498 variant fermentation 
broth are centrifuged at 5000 rpm for 35 minutes in 1 litre 

2 0 beakers. The supernatants are adjusted to pH 7.0 using 10% 
acetic acid and filtered on Seitz Supra S100 filter plates. 
The filtrates are concentrated to approximately 400 ml using an 
Amicon CH2A UF unit equipped with an Amicon S1Y10 UF cartridge. 
The UF concentrate is centrifuged and filtered prior to 

25 absorption at room temperature on a Bacitracin affinity column 
at pH 7. The PD498 variant is eluted from the Bacitracin column 
at room temperature using 25% 2-propanol and 1 M sodium 
chloride in a buffer solution with 0.01 dime-thyl-glutaric 
acid, 0.1 M boric acid and 0.002 M calcium chloride adjusted to 

30 pH 7. 

The fractions with protease activity from the Bacitracin 
purification step are combined and applied to a 750 ml Sephadex 
G25 column (5 cm diameter) equilibrated with a buffer 
containing 0.01 dimethylglutaric acid, 0.1 M boric acid and 
35 0.002 M calcium chloride adjusted to pH 6.0. 

Fractions with proteolytic activity from the Sephadex G25 
column are combined and applied to a 150 ml CM Sepharose CL 6B 
cat-ion exchange column (5 cm diameter) equilibrated with a 



Potato starch 



lOOg 



10 



Ground barley 
Soybean flour 
Na 2 HP0 4 X 12 H 2 0 
Pluronic 



50g 
20g 
9g 

O.lg 
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buffer containing 0.01 M dimethylglutaric acid, 0.1 M boric 
acid, and 0.002 M calcium chloride adjusted to pH 6.0. 
The protease is eluted using a linear gradient of 0-0.5 M 
sodium chloride in 1 litres of the same buffer. 
5 Protease containing fractions from the CM Sepharose column are 
combined and filtered through a 2ji filter. 

Balb/C mice IaG ELISA Procedure: 

• The antigen is diluted to 1 mg/ml in carbonate buffer. 
10 • 100 ml is added to each well. 

• The plates are coated overnight at 4°C. 

• Unspecif ic adsorption is blocked by incubating each well for 1 
hour at room temperature with 200 ml blocking buffer. 

• The plates are washed 3x with 300 ml washing buffer. 

15 ■ Unknown mouse sera are diluted in dilution buffer, typically 
lOx, 2 Ox and 4 Ox, or higher. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with washing buffer. 
20 • The anti-Mouse IgGl antibody is diluted 2000x in dilution 

buffer. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with washing buffer. 
25 • Streptavidine is diluted 1000X in dilution buffer. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with 300 ml washing 
buffer. 

30 • OPD (0.6 mg/ml) and H 2 0 2 (0.4 ml/ml) is dissolved in citrate 
buffer. 

• 100 ml is added to each well. 

• Incubation is for 10 minutes at room temperature. 

• The reaction is stopped by adding 100 ml H 2 S0 4 . 

35 ■ The plates are read at 492 nm with 620 nm as reference. 

Immunisation of mice 

Balb/C mice (20 grams) are immunised 10 times (intervals of 14 
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days) by subcutaneous injection of the modified or unmodified 
polypeptide in question, respectively by standard proceedures 
known in art. 



5 EXAMPLES 
Example 1 

Suitable substitutions in PD498 for addi tion of amino 
10 attachment groups f-NHo ) 

The 3D structure of parent PD498 was modeled as described 
above based on 59% sequence identity with Thermitase® 
(2tec.pdb) . 

The sequence of PD498 is (see SEQ ID NO. 2) . PD498 residue 
15 numbering is used, 1-280. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 



Conservative substitutions: 

20 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 55,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/residue 10 
Color_Subset 255,255,0 

25 4 Zone Subset NTERM :1:N Static monomer/residue 10 
Color_Subset 255,255,0 

5 #N0TE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 39, 72, 226 Static monomer/residue 8 
30 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object name according to the protein 

10 Combine Subset REST Difference PD498FINALMODEL ALLZONE 
35 11 List Subset REST Atom Output File restatom. list 

12 List Subset REST monomer /residue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output_File actsiteatom. list 

15 List Subset ACTSITE monomer /residue Output_File 
40 actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
ColorJSubset 

18 Combine Subset SUB 5 A Difference REST5A ACTSITE 
45 19 Combine Subset SUB5B Difference SUB5A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

21 List Subset SUB5B Atom Output File subSbatom. list 

22 List Subset SUB5B monomer/ residue Output_File sub5bmole. list 
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23 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list, 
subSbatom. list & accsiteatom. list 

5 

Comments : 

Lines 1-8: The subset ALLZONE is defined as those residues 
which are either within 10 A of the free amino groups on 
lysines or the N-terminal, or within 8 A of the catalytic triad 
10 residues 39, 72 and 226. 

Line 10: The subset REST is defined as those residues not 
included in ALLZONE. 

Lines 17-20: Subset SUB5B is defined as those residues in a 
5 A shell around REST, excluding residues within 8 A of the 
15 catalytic residues. 

Line 23-24: REST contains Arg62 and Argl69, SUB5B contains 
Arg51, Argl21, and Arg250. ACTSITE contains Argl03, but 
position 103 is within 8 A from essential_catalytic_residues, 
and thus not relevant. 
20 The colour codes are: (255,0,255) = magenta, 

(255,255, 0)yellow, (255,0,0) red, and (255, 255, 255)= white. 

The substitutions R51K, R62K, R121K, R169K and R250K are 
identified in parent PD498 as suitable sites for mutagenesis. 
The residues are substituted below in section 2, and further 
25 analysis done: 



Non-conservative substitutions: 
makeKzone2 .bcl 

I #sourcefile makezone2.bcl Claus von der Osten 961128 
30 2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace PD498FINALMODEL 
35 newmodel 

6 Biopolymer 

7 #NOTE: editnextline object name according to protein 

8 Blank Object On PD4 9 8 FINALMODEL 

9 #N0TE: editnextlines with lys->arg positions 
40 10 Replace Residue newmodel: 51 lys L 

II Replace Residue newmodel: 62 lys L 

12 Replace Residue newmodel: 121 lys L 

13 Replace Residue newmodel: 169 lys L 

14 Replace Residue newmodel: 250 lys L 
45 15 # 
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16 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

17 Color Molecule Atoms newmodel Specified Specification 
255,0,255 

5 18 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

19 Zone Subset NTERMx newmodel: 1:N Static monomer/ residue 10 
Color_Subset 255,255,0 

20 #NOTE: editnextline ACTSITEx residues according to the 
10 protein 

21 Zone Subset ACTSITEx newmodel: 39 , 72 , 226 Static 
monomer/ residue 8 Color_Subset 255,255,0 

22 Combine Subset ALLZONEx Union LYSx NTERMx 

23 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 24 Combine Subset RESTx Difference newmodel ALLZONEx 

25 List Subset RESTx Atom Output^File restxatom. list 

26 List Subset RESTx monomer /residue Output_File 
restxmole. list 

27 # 

20 28 Color Molecule Atoms ACTSITEx Specified Specification 
255 ,0,0 

29 List Subset ACTSITEx Atom Output File acts itexatom. list 

30 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole . list 

25 31 # . 

32 #read restxa torn. list or restxmole, list to identify sites 
for (not_arg) ->lys subst. if needed 

Comments : 

30 Lines 1-15: Solvent exposed arginines in subsets REST and 

SUB5B are replaced by lysines. Solvent accessibilities are 

recalculated following arginine replacement. 

Lines 16-23: The subset ALLZONEx is defined as those 

residues which are either within 10 A of the free amino groups 
3 5 on Lysines (after replacement) or the N-terminal, or within 8 A 

of the catalytic triad residues 39, 72 and 226, 

Line 24-26: The subset RESTx is defined as those residues 

not included in ALLZONEx, i.e. residues which are still 

potential epitope contributors. Of the residues in RESTx, the 
40 following are >5% exposed (see lists below): 6-7,9-12,43- 

45,65,87-88,209,211,216-221,262. 

The following mutations are proposed in parent PD498: P6K, 

Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, G87K, I88K, 

N209K, A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 
45 Relevant data for Example 1: 

Solvent accessibility data for PD498MODEL: 

# PD498MODEL Fri Nov 29 10:24:48 MET 1996 

# residue area 
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TRP_1 
SER_2 
PRO_3 
ASN_4 
5 ASP_5 
PRO_6 
TYR_7 
TYR_8 
SER_9 

10 ALA_10 
TYR_11 
GLN_12 
TYR_13 
GLY_14 

15 PRO_15 
GLN_16 
ASN_17 
THR_18 
SER_19 

20 THR_20 
PRO_21 
ALA_22 
ALA_23 
TRP_24 

25 ASP_25 
VAL_26 
THR_27 
ARG_28 
GLYJ29 

30 SER_30 
SER_31 
THR_32 
GLN_33 
THR_34 

35 VAL_35 
ALAJ36 
VAL_37 
LEU_38 
ASP_39 

40 SER_40 
GLY_41 
VAL_42 
ASP_43 
TYR_44 

45 ASN_45 
HIS_46 
PRO_47 
ASP_48 
LEU_49 

50 ALA_50 
ARG_51 
LYS_52 
VAL_53 
ILE_54 

55 LYS_55 
GLY_56 
TYR 57 



136.275711 

88.188095 

15.458788 

95.322319 

4.903404 

68.096909 

93.333252 

31.791576 

95.983139 

77.983536 

150.704727 

26.983349 

44.328232 

3.200084 

2.149547 

61.385445 

37.776707 

1.237873 

41.031750 

4.321402 

16.658991 

42.107288 

0.000000 

3.713619 

82.645493 

74.397812 

14.950654 

110.606209 

0.242063 

57.225292 

86.986198 

1.928865 

42.008949 

0.502189 

0.268693 

0.000000 

5.255383 

1.550332 

3.585718 

2.475746 

4.329043 

1.704864 

25.889742 

89.194855 

109.981819 

0.268693 

66.580925 

0.000000 

0.770882 

49.618046 

218.751709 

18.808538 

39.937984 

98.478104 

103.612228 

17.199390 

67.719147 
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ASP_58 
PHE_59 
ILE_60 
ASP_61 
5 ARG_62 
ASP_63 
ASN_64 
ASN_65 
PRO_66 

10 MET_67 
ASP_68 
LEU_69 
ASNJ70 
GLYJ71 

15 HIS_72 
GLYJ73 
THRJ74 
HISJ75 
VALJ76 

20 ALAJ77 
GLYJ78 
THRJ79 
VAL_80 
ALA_81 

25 ALA_82 
ASP_83 
THR_84 
ASN_85 
ASN_86 

30 GLY_87 
ILE_88 
GLY_89 
VAL_90 
ALA_91 

35 GLY_92 
MET_93 
ALA_94 
PRO_95 
ASP_96 

40 THR_97 
LYS_98 
ILE_99 
LEU_100 
ALA_101 

45 VAL_102 
ARGJL03 
VAL_104 
LEU_105 
ASP_106 

50 ALA_107 
ASN_108 
GLY_109 
SERJL10 
GLY_111 

55 SER_112 
LEU_113 
ASP 114 



0.000000 

40.291119 

50.151962 

70.078888 

166.777557 

35.892376 

120.641953 

64.982895 

6.986028 

58.504269 

28.668840 

104.467468 

78.460953 

5.615932 

43.158905 

0.268693 

0.000000 

0.484127 

1.880854 

0.000000 

0.933982 

9.589676 

0.000000 

0.000000 

0,000000 

46.244987 

27.783333 

75.924225 

44.813908 

50.453152 

74.428070 

4.115077 

6.717335 

2.872341 

0.233495 

5.876057 

0.000000 

17.682203 

83.431740 

1.506567 

72.674973 

4.251006 

6.717335 

0.806080 

1.426676 

2.662697 

2.171855 

18.808538 

52.167435 

52.905663 

115.871315 

30.943356 

57.933651 

50.705326 

56.383320 

71.312195 

110.410919 
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SER 


115 


13.910152 




ILE~ 


116 


22.570246 




ALA" 


117 


5.642561 




SER 


118 


29.313131 


5 


GLY 


119 


0.000000 




ILE 


120 


1.343467 




ARG 


121 


118.391129 




tyr" 


122 


44.203033 




ALA 


123 


0.000000 


10 


ALA 


124 


7.974043 




ASP 


125 


83.851639 




GLN 


126 


64.311974 




GLY 127 


36.812618 




ALA 128 


4.705107 


15 


LYS 


129 


90.886139 




VAL 


"130 


1.039576 




LEU" 


"131 


2.149547 




asn" 


'132 


4.315227 




LEU" 


"133 


1.880854 


20 


SER" 


"134 


3.563334 




LEU 


"135 


26.371397 




GLY" 


"136 


59.151070 




CYS~ 


"137 


63.333755 




GLU" 


"138 


111.553314 


25 


CYS" 


"139 


83.591461 




ASN" 


"140 


80.757843 




ser" 


"141 


25.899158 




THR" 


"142 


99.889725 




thr" 


"14 3 


73.323814 


30 


leu" 


"144 


5.589301 




LYS" 


"145 


94.708755 




SER" 


"146 


72.636993 




ALA" 


"147 


9.235920 




VAL" 


"148 


1.612160 


35 


asp" 


'149 


57.431465 




tyr" 


"150 


106.352493 




ALA" 


"151 


0.268693 




TRP" 


"152 


43.133667 




ASN" 


"153 


112.864975 


40 


LYS" 


"154 


110.009468 




GLY" 


"155 


33.352180 




ALA" 


"156 


3 .493014 




VAL 157 


1.048144 




VAL 158 


2.043953 


45 


VAL 


159 


0.000000 




ALA" 


160 


0.537387 




ALA" 


"161 


10.872165 




ALA" 


"162 


7.823834 




GLY" 


"163 


12.064573 


50 


asn" 


"164 


81.183388 




asp" 


"165 


64.495300 




asn" 


"166 


83.457443 




VAL" 


167 


68.516815 




SER" 


"168 


78.799652 


55 


arg" 


"169 


116.937134 




thr' 


"170 


57.275074 




phe" 


"171 


51.416462 
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GLN 


172 


18.934589 




PRO 


173 


1,880854 




ALA_ 


174 


6.522357 




SER 


175 


26.184139 


5 


TYR 


176 


21.425076 




PRO 


177 


85.613541 




ASN 


178 


34.700817 




ALA 


179 


0.268693 




ILE~ 


180 


1.074774 


10 


ALA~ 


181 


3.761708 




VAL 


182 


0.000000 




GLY 


183 


2.149547 




ALA 


184 


0.951118 




ILE 


185 


0.806080 


15 


ASP 


186 


30.022263 




SER 


187 


72.518509 




ASN 


188 


117.128021 




ASP 


189 


47.601345 




arc" 


"190 


150.050873 


20 


LYS" 


191 


64.822807 




ALA 


"l92 


2.686934 




SER 


"193 


96.223808 




PHE 


'194 


51.482613 




SER 


195 


1.400973 


25 


ASN 


"196 


4.148808 




TYR 


"197 


80.937309 




GLY" 


"198 


10.747736 




THR 


"199 


93.221252 




TRP" 


"200 


169.943604 


30 


VAL" 


"201 


15.280325 




ASP 202 


12.141763 




VAL 


203 


0.268693 




thr" 


"204 


3.409728 




ALA" 


"205 


0.000000 


35 


PRO" 


"206 


0.000000 




gly" 


"207 


0.000000 




val" 


"208 


37.137192 




ASN" 


"209 


78.286270 




ILE" 


"210 


9.404268 


40 


ALA** 


"211 


25.938599 




SER" 


"212 


5.037172 




thr" 


"213 


0.000000 




val" 


"214 


22.301552 




pro" 


"215 


45.251030 


45 


ASN" 


"216 


131.014160 




ASN" 


"217 


88.383461 




gly" 


"218 


21.226780 




TYR" 


"219 


88.907570 




SER" 


"220 


39.966541 


50 


TYR" 


"221 


166.037018 




met" 


"222 


50.951096 




SER" 


"223 


54.435001 




gly" 


"224 


1.880854 




thr" 


"225 


1.634468 


55 


SER" 


"226 


17.432346 




MET 


"227 


7.233279 




ALA" 


"228 


0.000000 
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SER_229 
PRO_23 0 
HIS_231 
VAL_232 
5 ALA_233 
GLY_234 
LEU_235 
ALA_236 
ALA_237 

10 LEU_238 
LEU_239 
ALA_240 
SER_241 
GLN_242 

15 GLY_243 
LYS_244 
ASN_245 
ASN_246 
VAL_247 

20 GLN_248 
ILE_249 
ARG_250 
GLN_251 
ALA_252 

25 ILE_253 
GLU_254 
GLN_255 
THR_256 
ALAJ257 

30 ASP_258 
LYS_259 
ILE_260 
SER_261 
GLY_262 

35 THR_263 
GLY_264 
THR_265 
ASN_266 
PHE_267 

40 LYS_268 
TYR_269 
GLY_270 
LYS_271 
ILE_272 

45 ASN_273 
SER_274 
ASN_275 
LYS_276 
ALA_277 

50 VAL_278 
ARG_279 
TYR_280 
CA_281 
CA_282 

55 CA 283 



0.000000 
0.268693 
2.680759 
0.000000 
0.000000 

I. 074774 

II. 500556 
0.000000 
0.000000 

I. 612160 
0.000000 
10.648088 
39.138004 
71.056175 
66.487144 
43.256012 
80.728127 
34.859673 
84.145645 
51.819775 
8.598188 
35.055809 
71.928093 
0.000000 
4.845899 
13.344438 
81.705254 
9.836061 
2.810513 
44.656136 
113.071686 
32.089527 
91.590103 
26.450439 
38.308762 
46.870056 
88.551804 
34.698349 
7.756911 
103.212852 
37.638382 
0.000000 

II. 376978 
2.885231 
19.195255 
2.651736 
38.177547 
84.549576 
1.074774 
4.775503 
162.693054 
96.572929 
0.000000 
0.000000 
8.803203 



Subset REST: 



L 
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restmole. list 
Sxibsot REST" 

PD4 9 8 FINALMODEL : 6-7 , 9-12 , 43-46 , 61-63 , 65 , 87- 
89 , 111-114 , 117-118 , 131 , 
5 PD4 9 8 FINALMODEL: 137-139 , 158-159 , 169-171 f 173- 
174,180-181,209,211, 

PD4 9 8 FINALMODEL: 216-221, 232-233, 262, E282H 

restatom. list 
Subset REST: 
10 PD4 9 8 FINALMODEL : PRO 6 :N, CA, CD, C,0, CB, CG 

PD4 9 8 FINALMODEL : TYR 7 : N , CA, C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
PD4 9 8 FINALMODEL : SER 9 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL: ALA 10 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL: TYR 11 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ ,OH 
15 PD4 9 8 FINALMODEL :GLN 12 :N, CA, C, O, CB, CG, CD, OE1 ,NE2 

PD4 9 8 FINALMODEL : ASP 4 3 : N , CA , C , O , CB , CG , OD1 , OD2 
PD4 9 8 F IN ALMODEL : TYR 

44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL : ASN 4 5 : N , CA , C , O , CB , CG , OD1 , ND2 
2 0 PD4 9 8 FINALMODEL : HI S 

46:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL : ASP 6 1 : N , CA , C , O , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL : ARG 
62:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
25 PD4 9 8 FINALMODEL: ASP 63 : N, CA, C, O, CB, CG, OD1 , OD2 

PD4 9 8 FINALMODEL : ASN 6 5 : N , CA , C , O , CB , CG , OD1 , ND2 
PD498FINALMODEL: GLY 87:N,CA,C,0 

PD49 8 FINALMODEL : ILE 8 8 : N , CA , C , O , CB , CG 1 , CG2 , CD 1 

PD4 9 8 FINALMODEL: GLY 89:N,CA,C,0 
30 PD4 9 8 FINALMODEL: GLY 111:N,CA,C,0 

PD49 8 FINALMODEL : SER 1 1 2 : N , CA , C , O , CB , OG 

PD4 9 8 FINALMODEL : LEU 1 1 3 : N , CA , C , O , CB , CG , CD1 , CD2 

PD4 9 8 FINALMODEL: ASP 114 :N, CA, C, O, CB , CG, OD1 , OD2 

PD498FINALMODEL: ALA 117 :N, CA, C,0, CB 
35 PD498FINALMODEL: SER 118 :N,CA, C,0, CB,OG 

PD498 FINALMODEL : LEU 1 3 1 : N , CA , C , O , CB , CG , CD 1 , CD 2 

PD4 9 8 FINALMODEL : CYS 137 :N, CA, C,0, CB, SG 

PD4 9 8 FINALMODEL : GLU 
138:N,CA,C,0,CB,CG,CD,OEl,OE2 
40 PD4 9 8 FINALMODEL: CYS 139:N,CA,C,0,CB,SG 

PD4 9 8 FINALMODEL : VAL 158 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : VAL 1 5 9 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ARG 
169:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
45 PD4 9 8 FINALMODEL :THR 170 :N, CA, C, O, CB, OG1, CG2 

PD4 9 8 FINALMODEL : PHE 
171:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD498FINALMODEL : PRO 173 : N , CA, CD , C , O , CB , CG 

PD4 9 8 FINALMODEL: ALA 174 :N, CA, C,0, CB 
50 PD498FINALMODEL: ILE 180:N,CA,C,0,CB,CG1,CG2 ,CD1 

PD4 9 8 FINALMODEL: ALA 181 : N, CA, C, O, CB 

PD4 9 8 FINALMODEL : ASN 2 0 9 : N , CA , C , O , CB , CG , OD1 , ND2 

PD4 9 8 FINALMODEL : ALA 211 : N, CA, C, O, CB 

PD4 98 FINALMODEL : ASN 2 1 6 : N , CA , C , O , CB , CG , OD1 , ND2 
55 PD4 9 8 FINALMODEL : ASN 2 17 : N , CA , C , O , CB r CG , OD1 , ND2 

PD4 9 8 FINALMODEL: GLY 218:N,CA,C,0 
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PD4 9 8 FINALMODEL : TYR 

2 19 : N f CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
PD498FINALMODEL: SER 220 :N, CA,C,0,CB, OG 
PD4 98FINALMODEL : TYR 
5 221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD4 9 8 FINALMODEL : VAL 232 :N,CA, 0,0,08,001,062 
PD4 9 8 FINALMODEL : ALA 233 : N, CA, C, O, CB 
PD4 9 8 FINALMODEL :GLY 262:N,CA,C,0 
PD4 9 8 FINALMODEL :CA E282H:CA 

10 

Subset SUB5B: 

sub5bmole. list 
Subset SUB5B: 

PD498FINALMODEL:4-5,8, 13-16, 34-35, 47- 
15 51,53,64,83,85-86,90-91,120-124, 

PD4 9 8 FINALMODEL: 128-130 , 140-14 1 , 143-144 , 147- 
148,151-152,156-157, 

PD498FINALMODEL: 165, 167-168, 172, 175-176, 178- 

179,196,200-205,208, 
20 PD4 9 8 FINALMODEL: 234-237, 250, 253-254, 260-2 61, 263- 

267,272,E281H, 

PD4 9 8 FINALMODEL: E283H 

sub5batom. list 
25 Subset SUB5B: 

PD 4 98 FINALMODEL : ASN 4 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : ASP 5 : N , CA , C , O , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL : TYR 
8:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

3 0 PD4 9 8 FINALMODEL : TYR 

13:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL :GLY 14:N,CA,C,0 
PD4 9 8 F I N ALMODEL : PRO 15 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL :GLN 16:N,CA,C,0,CB,CG,CD,OEl,NE2 

35 PD4 9 8 FINALMODEL : THR 34 :N, CA, C, O, CB, OG1 , CG2 

PD4 9 8 FINALMODEL : VAL 3 5 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 FINALMODEL : PRO 47 : N, CA, CD , C, O, CB , CG 
PD4 9 8 FINALMODEL : ASP 4 8 : N , CA , C , 0 , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL : LEU 4 9 : N , CA , C , O , CB , CG , CD1 , CD2 

40 PD4 9 8 FINALMODEL : ALA 50:N,CA,C,O,CB 

PD4 9 8 FINALMODEL : ARG 

51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL : VAL 53 : N , CA , C , O , CB , CGI , CG2 
PD4 98FINALMODEL : ASN 64 : N , CA , C , 0 , CB , CG , OD1 , ND2 

45 PD498FINALMODEL: ASP 83 :N,CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL : ASN 85 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : ASN 8 6 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : VAL 90 : N , CA , C , 0 , CB , CGI , CG2 
PD4 9 8 FINALMODEL: ALA 91:N, CA, C, 0,CB 

50 PD498FINALMODEL: ILE 120:N,CA,C,O,CB,CGl,CG2,CDl 

PD4 9 8 F IN ALMODEL : ARG 

121:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL : TYR 
122 : N , CA , C , O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ , OH 

55 PD4 9 8 FINALMODEL : ALA 123:N,CA,C,0,CB 

PD498FINALMODEL: ALA 124 : N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ALA 128:N,CA,C,0,CB 
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PD498FINALMODEL:LYS 129 : N, CA, C,0, CB, CG, CD, CE, NZ 
PD4 9 8 FINALMODEL : VAL 130 :N,CA, C,0, CB,CG1,CG2 
PD4 9 8 FINALMODEL : ASN 14 0:N,CA,C ,0,CB,CG,0D1,ND2 
PD4 9 8 F INALMODEL : SER 1 4 1 : N , OA , 0 , 0 , CB , OG 
5 PD 4 9 8 F INALMODEL : THR 143 :N,CA,C,0,CB,0G1,CG2 

PD 4 9 8 F INALMODEL : LEU 14 4 : N , OA , C , O , CB , CG , CD1 , CD 2 
PD4 9 8 FINALMODEL : ALA 147:N,CA / C,O r CB 
PD4 9 8 FINALMODEL : VAL 148 :N,CA, C,0, CB, CGI, CG2 
PD4 9 8 FINALMODEL : ALA 151:N,CA,C,0,CB 
10 PD 4 9 8 FINALMODEL : TRP 

52:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3, 

C22,CZ3,CH2 
PD4 9 8 F INALMODEL : ALA 156:N,CA,C,0,CB 
PD4 9 8 FINALMODEL: VAL 157 :N, CA, C, O, CB, CGI, CG2 
15 PD4 9 8 FINALMODEL: ASP 165:N,CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL : VAL 1 6 7 : N , CA , C , O , CB , CGI , CG2 
P D 4 9 8 F I N ALMO DEL: SER 168:N,CA,C,0,CB,OG 
PD498FINALMODEL: GLN 

172:N,CA,C,0,CB,CG,CD,0E1,NE2 
20 PD4 9 8 FINALMODEL: SER 175 :N,CA,C,0,CB,OG 

PD 4 9 8 FINALMODEL : T YR 

176:N,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 
PD4 9 8 FINALMODEL : ASN 1 7 8 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : ALA 17 9 : N , CA , C , 0 , CB 
25 PD4 9 8 FINALMODEL: ASN 1 9 6 : N , CA , C , O , CB , CG , 0D1 , ND2 

PD4 98FINALMODEL : TRP 

200:N,CA,C,O,CB,CG,CDl,CD2,NEl,CE2,CE3, 

CZ2,CZ3,CH2 
PD4 9 8 FINALMODEL: VAL 2 01 : N , CA , C , O , CB , CGI , CG2 

3 0 PD 4 9 8 FINALMODEL : ASP 2 02 : N , CA , C , O , CB , CG , 0D1 , 0D2 

PD4 9 8 F INALMODEL : VAL 2 03 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : THR 2 04 : N , CA , C , O , CB , OG1 , CG2 
PD498 FINALMODEL : ALA 2 05 : N , CA , C , O , CB 
PD4 9 8 FINALMODEL : VAL 208 :N,CA, C,0, CB, CG1,CG2 

35 PD4 9 8 FINALMODEL :GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL : LEU 2 3 5 : N , CA , C , O , CB , CG , CD1 , CD2 
PD4 9 8 FINALMODEL : ALA 236 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ALA 2 3 7 : N , CA , C , O , CB 
PD4 9 8 FINALMODEL : ARG 

40 250:N,CA,C,O,CB,CG,CD,NE,CZ,NHl,NH2 

PD4 9 8 FINALMODEL : ILE 253 : N , CA , C , O , CB , CGI , CG2 , CD1 
PD4 9 8FINALMODEL : GLU 

254:N,CA,C,0,CB,CG,CD,OEl,OE2 
PD4 9 8 FINALMODEL : ILE 2 6 0 : N , CA , C , O , CB , CGI , CG2 , CD1 

45 PD4 9 8 FINALMODEL: SER 261 :N, CA, C,0, CB,OG 

PD4 9 8 FINALMODEL : THR 2 6 3 : N , CA , C , O , CB , OG 1 , CG2 
PD4 9 8 FINALMODEL :GLY 264:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 265 :N, CA, C,0, CB,0G1 , CG2 
PD4 98FINALMODEL : ASN 266 :N,CA,C,0,CB,CG,0D1,ND2 

5 0 PD4 9 8 FINALMOD EL : PHE 

267:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
PD4 9 8FINALMODEL : ILE 2 7 2 : N , CA , C , O , CB , CGI , CG2 , CD1 
PD498FINALMODEL: CA E281H:CA 
PD4 9 8 FINALMODEL : CA E283H:NA 

55 

Subset ACTSITE: 

actsitemole, list 
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Subset ACTSITE: 

PD498FINALMODEL: 36-42 , 57-60 , 66-80 , 100-110 , 115- 
116, 119, 132-136, 160-164 , 

PD498FINALMODEL: 182-184 , 194 ,206-207 , 210 ,212- 
5 215,222-231 

actsiteatom. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL : ALA 36:N,CA,C,0,CB 

10 PD4 9 8 FINALMODEL : VAL 37 : N, CA, C, O, CB, CGI, CG2 

PD4 9 8 FINALMODEL : LEU 3 8 : N , CA , C , O , CB , CG , CD 1 , CD2 
PD4 9 8 FINALMODEL : ASP 3 9 : N , CA , C , O , CB , CG , OD1 , OD2 
PD498FINALMODEL: SER 4 0 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL : GLY 41:N,CA,C,0 

15 PD4 9 8 FINALMODEL : VAL 42 :N, CA, C,0, CB, CG1,CG2 

PD49 8 FINALMODEL : TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD498 FINALMODEL : ASP 5 8 : N , CA , C , O , CB , CG , OD 1 , OD 2 
PD 4 9 8 F I N ALMODEL : PHE 

20 59:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL : ILE 60 : N , CA, C , O , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL : PRO 6 6 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL : MET 67 : N , CA , C , O , CB , CG , SD , CE 
PD4 9 8 FINALMODEL : ASP 6 8 : N , CA , C , O , CB , CG , OD1 , OD2 

25 PD4 9 8 FINALMODEL : LEU 69:N,CA,C,0,CB, CG,CD1,CD2 

PD4 9 8 FINALMODEL :ASN 70:N,CA,C,O,CB,CG,ODl,ND2 
PD4 9 8 FINALMODEL : GLY 71:N,CA,C,0 
PD4 9 8 FINALMODEL : HIS 

72:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 PD4 9 8 FINALMODEL: GLY 73:N,CA,C,0 

PD4 9 8 FINALMODEL : THR 7 4 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL: HIS 

75:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL: VAL 76:N,CA,C,0,CB,CG1,CG2 

35 PD4 9 8 FINALMODEL: ALA 77 :N, CA, C,0, CB 

PD4 9 8 FINALMODEL: GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL: THR 79:N,CA,C,0,CB,0G1, CG2 
PD 4 9 8 FINALMODEL : VAL 80 : N, CA, C , O , CB, CGI , CG2 
PD49 8 FINALMODEL : LEU 1 0 0 : N , CA , C , O , CB , CG , CD1 , CD 2 

40 PD4 9 8 FINALMODEL: ALA 101:N,CA,C,O,CB 

PD4 9 8 FINALMODEL : VAL 102 :N,CA, C,0,CB, CGI, CG2 
PD4 9 8 FINALMODEL : ARG 

103:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8FINALMODEL : VAL 104 : N , CA , C , O , CB , CGI , CG2 

45 PD4 9 8 FINALMODEL : LEU 105:N,CA,C,O,CB,CG,CDl,CD2 

PD4 9 8 FINALMODEL : ASP 1 0 6 : N , CA , C , 0 , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL : ALA 107 :N,CA, C, 0,CB 
PD4 9 8FINALMODEL : ASN 1 08 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL: GLY 109:N,CA,C,0 

50 PD4 9 8 FINALMODEL: SER 110 : N, CA, C, 0, CB ,OG 

PD4 9 8 FINALMODEL : SER 1 15 : N , CA , C , 0 , CB , OG 
PD4 9 8FINALMODEL : ILE 1 1 6 : N , CA , C , O , CB , CGI , CG2 , CD 1 
PD4 9 8 FINALMODEL : GLY 119:N,CA,C,0 
PD4 9 8 FINALMODEL : ASN 13 2 : N , CA, C , O , CB , CG , OD1 , ND 2 

55 PD4 98 FINALMODEL: LEU 133 :N, CA, C, 0, CB, CG, CD1 , CD2 

PD4 9 8FINALMODEL : SER 1 3 4 : N , CA , C , 0 , CB , OG 
PD4 9 8 FINALMODEL : LEU 135 : N , CA, C , 0 , CB , CG , CD1 , CD2 
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PD4 9 8FINALMODEL : GLY 136:N,CA,C,0 
PD4 9 8FINALM0DEL : ALA 160 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ALA 161 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ALA 162 :N,CA,C,0,CB 
5 PD4 9 8 FINALMODEL : GLY 163:N,CA,C,0 

PD 4 9 8 FINALMODEL : ASN 1 6 4 : N , CA , C , O , CB , CG , OD 1 , ND 2 
PD4 9 8 FINALMODEL :VAL 182 :N, CA,C,0,CB , CGI , CG2 
PD4 9 8 FINALMODEL: GLY 183:N,CA,C,0 
PD4 9 8 FINALMODEL: ALA 184 :N,CA,C, 0,CB 
1 0 PD4 9 8 FINALMODEL : PHE 

194:N,CA,C,0,CB / CG,CD1,CD2,CE1,CE2,CZ 
PD4 9 8 FINALMODEL : PRO 2 06 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL: GLY 207:N,CA,C,O 

PD4 9 8 FINALMODEL :ILE 210 :N, CA,C, 0 / CB / CGl,CG2 ,CD1 
15 PD4 9 8 FINALMODEL :SER 212 :N, CA, C, O, CB, OG 

PD4 9 8 FINALMODEL : THR 2 13 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL : VAL 2 14 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : PRO 2 15 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL: MET 222 :N, CA, C, O, CB, CG, SD, CE 
20 PD4 9 8 FINALMODEL :SER 223 :N, CA, C,0, CB, OG 

PD4 9 8 FINALMODEL: GLY 224:N,CA,C,0 
PD4 9 8 FINALMODEL: THR 225 :N, CA, C, O, CB, OG1 , CG2 
PD4 9 8 FINALMODEL : SER 226:N,CA,C,0,CB,OG 
PD4 9 8 FINALMODEL : MET 2 2 7 : N , CA , C , O , CB , CG , SD , CE 
25 PD4 9 8 FINALMODEL: ALA 228 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL: SER 229 :N, CA, C, O, CB, OG 
PD4 9 8 FINALMODEL: PRO 230 :N , CA, CD , C , O, CB, CG 
PD498FINALMODEL:HIS 
231:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 

Subset RESTx: 

restxmole, list 
Subset RESTX: 

NEWMODEL: 6-7, 9-12, 43-46, 65,87- 
35 89,131,173,209,211,216-221,232-233, 
NEWMODEL : 2 6 2 , E2 8 2 H 

restxatom. list 
Subset RESTX: 
40 NEWMODEL : PRO 6:N, CA,CD, C,0,CB,pG 

NEWMODEL :TYR 
7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL: SER 9:N,CA,C,0,CB,0G 
NEWMODEL : ALA 10:N,CA,C,O,CB 
45 NEWMODEL :TYR 

1 1 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

NEWMODEL: GLN 12 :N, CA, C, O, CB, CG, CD, OE1 , NE2 
NEWMODEL : ASP 43 :N, CA, C, O, CB, CG,ODl, OD2 
NEWMODEL :TYR 
50 44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWMODEL: ASN 45:N,CA,C,0,CB,CG,0D1,ND2 
NEWMODEL: HIS 46:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
NEWMODEL : ASN 6 5 : N , CA , C , 0 , CB , CG , OD 1 , ND2 
NEWMODEL: GLY 87:N,CA,C,0 
55 NEWMODEL: ILE 88 :N, CA, C, O, CB, CGI, CG2 , CD1 

NEWMODEL: GLY 89:N,CA,C,0 

NEWMODEL : LEU 131 : N, CA, C, O, CB, CG, CD1 , CD2 



L 
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NEWMODEL : PRO 1 7 3 : N , CA , CD , C , O , CB , CG 
NEWMODEL : ASN 2 0 9 : N , CA r C,O f CB , CG , OD1 , ND2 
NEWMODEL : ALA 211:N,CA,C,0,CB 
NEWMODEL: ASN 2 16 : N , CA , C , O , CB , CG , OD1 , ND2 
5 NEWMODEL : ASN 2 17 : N , CA , C , O , CB , CG , OD1 , ND2 

NEWMODEL : GLY 218:N,CA,C,0 
NEWMODEL :TYR 
219:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL : SER 220:N,CA,C,O,CB, OG 

10 NEWMODEL :TYR 

221:N,CA,C,0,CB,CG,CD1, CD2 , CE1, CE2 , CZ , OH 
NEWMODEL: VAL 232 : N, CA, C, O, CB, CGI, CG2 
NEWMODEL : ALA 233 :N,CA,C,0,CB 
NEWMODEL : GLY 262:N,CA,C,0 

15 NEWMODEL :CA E282H:CA 



Example 2 

Suitable substitutions in Savinase® for addition of amino 
20 attachment groups (-NH2I 

The known X-ray structure of Savinase® was used to find 
where suitable amino attachment groups may is added (Betzel et 
al, (1992), J. Mol. Biol. 223, p. 427-445). 

The 3D structure of Savinase® is available in the Brookhaven 
25 Databank as Isvn.pbd. A related subtilisin is available as 
lst3 .pdb. 

The sequence of Savinase® is shown in SEQ ID NO. 3 
The sequence numbering used is that of subtilisin BPN 1 , 
Savinase® having deletions relative to BPN 1 at positions: 36, 
30 56, 158-159 and 163-164. The active site residues (functional 
site) are D32,H64 and S221. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 



35 Conservative substitutions: 
makeKzone.bcl 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
Zone Subset LYS :lys:NZ Static monomer/residue 10 Color_Subset 
40 255,255,0 

Zone Subset NTERM :el:N Static monomer/ residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE : e32 , e64 , e221 Static monomer /residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union LYS NTEPM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#NOTE: editnextline object name according to the protein 
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Combine Subset REST Difference SAVI8 ALLZONE 
List Subset REST Atom Output File restatom. list 
List Subset REST monomer/ residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 List Subset ACTSITE Atom Output_File acts iteatom. list 
List Subset ACTSITE monomer /residue Output_File 
actsitemole . list 
# 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
10 Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output File sub5batom. list 
List Subset SUB5B monomer /residue Output_File sub5bmole. list 
15 #Now identify sites for lys->arg substitutions and continue 
with makezone2.bcl 

#Use grep command to identify ARG in restatom. list , 
sub5batom. list & accsiteatom. list 

2 0 Comments : 

In this case of Savinase® REST contains the Arginines ArglO, 
Argl70 and Arg 186, and SUB5B contains Argl9, Arg45, Argl45 and 
Arg247. 

These residues are all solvent exposed. The substitutions 
25 R10K, R19K, R45K, R145K, R170K, R186K and R247K are identified 
in Savinase® as sites for mutagenesis within the scope of this 
invention. The residues are substituted below in section 2, 
and further analysis done. The subset ACTSITE contains Lys94 . 
The substitution K94R is a mutation removing Lysine as 
30 attachment group close to the active site. 



Non-conservative substitutions : 
makeKzone2 • bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
35 # 

#having scanned lists (grep arg command) and identified sites 
for lys->arg substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace SAVI8 newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On SAVI8 

#NOTE : editnextlines with lys->arg positions 

Replace Residue newmodel :el0 lys L 
45 Replace Residue newmodel :e 170 lys L 

Replace Residue newmodel :el8 6 lys L 

Replace Residue newmodel :el9 lys L 

Replace Residue newmodel :e45 lys L 

Replace Residue newmodel :el4 5 lys L 
50 Replace Residue newmodel :e2 41 lys L 
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#Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
5 Zone Subset LYSx newmodel: lys:NZ Static monomer/residue 10 
ColorJSubset 255,255,0 

Zone Subset NTERMx newmodel: el :N Static monomer/residue 10 
ColorJSubset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel :e3 2, e64,e2 21 Static 
monomer /residue 8 Color_Subset 255,255,0 
Combine Subset ALLZONEx Union LYSx NTERMx 
Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
Combine Subset RESTx Difference newmodel ALLZONEx 
15 List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole. list 

# ... 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 

List Subset ACTSITEx Atom Output_File actsitexatom. list 
20 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
(not_arg) ->lys subst. if needed 

25 

Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 5,14,22,38-40,42,75-76,82,86,103-105,108,133- 
135,137,140,173,204,206,211-213,215-216,269. The following 
30 mutations are proposed in Savinase®: P5K, P14K, T22K, T38K, 

H39K, P40K, L42K, L75K, N76K, L82K, P86K, S103K, V104K, S105K, 
A108K, A133K, T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, 
G211K, S212K, T213K, A215K, S216K, N269K. 
Relevant data for Example 2: 

35 Solvent accessibility data for SAVINASE®: 

# SAVI8NOH20 Fri Nov 29 13:32:07 MET 1996 

# residue area 





ALA 


1 


118.362808 




gln" 


"2 


49.422764 


40 


SER~ 


"3 


61.982887 




VAL" 


"4 


71.620255 




pro" 


"5 


21.737535 




TRP" 


"6 


58.718731 




gly" 


"7 


4.328117 


45 


ile" 


"8 


6.664074 




ser" 


"9 


60.175900 




arg" 


"10 


70.928963 




val" 


"ll 


2.686934 




gln" 


~12 


72.839996 


50 


ALA" 


"13 


0.000000 




PRO 14 


52.308453 




ALA 


15 


38.300892 




ALA" 


"16 


0.000000 
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HISJL7 
ASN_18 
ARG_19 
GLY_20 
5 LEU_21 
THR_22 
GLY_23 
SER_24 
GLY_25 

10 VAL_26 
LYS_27 
VAL_28 
ALA_29 
VAL_30 

15 LEUJ31 
ASP_32 
THRJ33 
GLY_34 
ILE_35 

20 SER_36 
THR_37 
HIS_38 
PRO_39 
ASP_40 

25 LEU_41 
ASN_42 
ILE_43 
ARG_44 
GLY_45 

30 GLY_46 
ALA_47 
SER_48 
PHE_49 
VAL_50 

35 PRO_51 
GLY_52 
GLU_53 
PRO_54 
SER_55 

40 THR_56 
GLN_57 
ASP_58 
GLY_59 
ASN_60 

45 GLY_61 
HIS_62 
GLY_63 
THR_64 
HIS_65 

50 VAL_66 
ALA_67 
GLY_68 
THR_69 
ILEJ70 

55 ALAJ71 
ALAJ72 
LEU 73 



41.826324 

136.376602 

105.678642 

48.231510 

17.196377 

36.781742 

0.000000 

64.151276 

50.269905 

4.030401 

54.239555 

0.000000 

0.000000 

3.572827 

0.233495 

1.074774 

1.973557 

3.638052 

8.044439 

8.514903 

122.598907 

18.834011 

76.570526 

0.000000 

19.684013 

88.870216 

56.117710 

110.647194 

26.935413 

35.515778 

21.495472 

34.876190 

52.647541 

23.364208 

110.408752 

80.282906 

43.033707 

124.444336 

60.284889 

47.103241 

120.803505 

12.784743 

61.742443 

56.760231 

1.576962 

38.590118 

0.000000 

0.537387 

0.968253 

1.612160 

0.000000 

2.801945 

9.074596 

0.000000 

4.577205 

0.000000 

47.290039 
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ASN_74 
ASNJ75 
SERJ76 
ILEJ77 
5 GLYJ78 
VAL_79 
LEU_80 
GLY_81 
VAL_82 

10 ALA_83 
PRO_84 
SER_85 
ALA_86 
GLU_87 

15 LEU_88 
TYR_89 
ALA_90 
VAL_91 
LYS_92 

20 VAL_93 
LEU_94 
GLY_95 
ALA_96 
SER_97 

25 GLY_98 
SER_99 
GLY_100 
SERJL01 
VAL_102 

30 SER_103 
SER_104 
ILEJL05 
ALAJL06 
GLNJL07 

35 GLYJL08 
LEUJ.09 
GLU_110 
TRP_111 
ALAJL12 

40 GLY_113 
ASN_114 
ASN_115 
GLYJL16 
MET_117 

45 HIS_118 
VALJL19 
ALAJL20 
ASNJL21 
LEU_122 

50 SER_123 
LEU_124 
GLY_125 
SERJL26 
PRO_127 

55 SER_128 
PRO_129 
SER 130 



102.187248 

60.210400 

84.614494 

66.098572 

17.979534 

5.642561 

13.025185 

0.000000 

0.268693 

0.000000 

18. 193810 

56.839039 

13.075745 

37.011765 

2.149547 

30.633518 

I. 343467 
0.779450 
5.862781 
0.466991 
10.747736 
8.707102 
41.414677 
96.066040 
33.374485 
67.664116 
35.571117 
54.096992 
52.695324 
62.929684 
8.683097 
15.852910 
14.509443 
94.463066 
0.000000 
0.537387 
63.227707 
55.500740 
0.502189 

II. 908267 
107.208527 
78.811234 
41.453194 
9.634291 
54.022118 
5.105174 
0.268693 
0.233495 
0.537387 
4.004620 
21.927265 
55.952454 
40.241180 
107.409439 
57.988609 
85.021118 
20.460915 
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ALA 


131 


57.404362 




THR 


132 


74.438805 




LEU~ 


133 


12.091203 




GLU 


134 


73.382019 


5 


GLN 


135 


114.870010 




ALA 


136 


2.122917 




VAL 


137 


1.074774 




ASN_ 


138 


55.622704 




SER 


139 


29.174965 


10 


ALA~ 


140 


0.268693 




THR 


141 


27.962946 




SER 


142 


87.263145 




ARG 


143 


88.201218 




gly" 


144 


38.477882 


15 


VAL 


145 


2.079151 




LEU" 


"146 


13.703363 




VAL 


147 


2.690253 




VAL 


"148 


1.074774 




ALA 


149 


0.000000 


20 


ALA 


'150 


4.356600 




SER 


"151 


0.000000 




GLY" 


"152 


12.628590 




asn" 


"153 


84.248703 




ser" 


"154 


77.662354 


25 


gly" 


155 


25.409861 




ALA 


156 


38.074570 




GLY~ 


"157 


40.493744 




ser" 


"158 


53.915291 




ile" 


159 


4.352278 


30 


ser" 


"160 


12.458543 




tyr" 


"161 


29.670284 




pro" 


"162 


4.030401 




ALA" 


"163 


0.968253 




ARG" 


"164 


84.059120 


35 


tyr" 


"165 


28.641129 




ALA" 


"166 


68.193314 




ASN" 


"167 


61.686481 




ALA" 


'168 


0.537387 




MET" 


"169 


0.586837 


40 


ALA" 


"170 


0.000000 




VAL" 


"171 


0.000000 




GLY" 


"172 


0.000000 




ALA" 


"173 


0.933982 




THR" 


"174 


3.013133 


45 


asp" 


"175 


34.551376 




GLN 


"176 


96.873039 




ASN" 


177 


98.664368 




ASN 


"178 


41.197159 




asn" 


179 


60.263512 


50 


ARG" 


180 


64.416336 




ALA' 


"181 


7.254722 




ser" 


"182 


91.590881 




phe" 


~183 


52.126518 




ser" 


184 


2.101459 


55 


gln" 


"185 


15.736279 




tyr" 


"186 


44.287792 




gly" 


~187 


5.114592 
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ALA 


188 


69,406563 




GLY 


189 


36.926083 




LEU 


190 


16.511177 




ASP 


191 


7.705349 


5 


ILE 


192 


0.268693 




VAL~ 


193 


4.299094 




ALA 


194 


0.000000 




PRO 


195 


0.806080 




GLY 196 


0.000000 


10 


VAL 197 


25.257177 




ASN 


198 


82.177422 




VAL" 


199 


10.747736 




GLN~ 


200 


80.374527 




SER 


"2 01 


2.008755 


15 


THR 


202 


0.000000 




TYR~ 


203 


80. 679886 




PRO 


204 


34.632195 




GLY 


205 


74.536827 




SER 


206 


74.964920 


20 


THR 


"207 


57.070065 




TYR 


208 


82.895500 




ALA 


"209 


22.838940 




SER~ 


"210 


69.045639 




LEU 


"211 


49.708279 


25 


ASN 


"212 


86.905457 




GLY 


"213 


2.686934 




Tfflf 


"214 


4.669909 




SER~ 


215 


15.225292 




met" 


"216 


7.261287 


30 


ALA 217 


0.000000 




THR 218 


0.000000 




PRO 


219 


0.806080 




HIS" 


"220 


2.662697 




VAL" 


"221 


0.268693 


35 


ALA" 


"222 


0.000000 




gly" 


"223 


0.000000 




ALA" 


"224 


7.206634 




ALA" 


"225 


1.039576 




ALA" 


"226 


0.268693 


40 


LEU" 


"227 


1.074774 




val" 


"228 


1.541764 




LYS" 


"229 


39.262505 




gln" 


"230 


54.501614 




LYS~ 


"231 


81.154129 


45 


ASN" 


"232 


30.004124 




PRO" 


"233 


91.917931 




SER" 


"234 


102.856705 




TRP" 


"235 


64.639481 




SER' 


"236 


51.797619 


50 


ASN 


237 


24.866917 




VAL 


""238 


78.458466 




GLN 239 


73.981461 




ILE 


240 


14.474245 




ARG" 


"241 


41.242931 


55 


ASN 242 


64.644814 




HIS 


243 


50.671440 




LEU 


~244 


5.127482 



- WO 98/35026 



61 



PCT/DK98/00046 





LYS 


245 


48.820000 




ASN~ 


"246 


115.264534 




THR 


"247 


22.205376 




ALA_ 


"248 


16.415077 


5 


THR 


"24 9 


60.503101 




SER~ 


"250 


74.511597 




LEU" 


"251 


48.861599 




GLY 252 


39.124340 




SER 


"253 


49.811481 


10 


THR~ 


"254 


88.421982 




asn" 


"255 


72.490181 




leu"" 


"256 


54.835758 




tyr" 


"257 


38.798912 




gly~ 


"258 


3.620916 


15 


ser" 


"259 


35.017368 




gly" 


"260 


0.537387 




leu" 


"261 


8.598188 




VAL 262 


4.519700 




ASN 


263 


16.763659 


20 


ALA" 


"264 


3.413124 




GLU" 


"265 


37.942276 




ALA 266 


15.871746 




ALA 


267 


3.947115 




THR" 


"268 


2.475746 


25 


ARG~ 


"269 


176.743362 




ION" 


"270 


0.000000 




ion" 


"271 


5.197493 




Subset REST: 



restmole. list 



30 Subset REST: 

SAVI8 : E5-E15 , E17-E18 , E22 , E38-E40 , E42-E43 , E73-E76 , E82-E86 , E103 
E105, 

SAVI8 : E108-E109 , E111-E112 , E115-E116 , E122 , E128-E144 , E149- 
E150,E156-E157, 
35 SAVI8 : E160-E162 , E165-E168 , E170-E171, E173 , E180-E188 , E190- 
E192,E200, 

SAVI8 :E203-E204 , E206, E211-E213 , E215-E216, E227-E230 , E255- 
E259,E261-E262, 
SAVI8:E267-E269 
40 restatom. list 
Subset REST: 

SAVI8:PRO E5:N,CD,CA,CG,CB # C,0 

SAVI8:TRP E6:N,CA,CD2,CE2,NE1,CD1,CG / CE3,CZ3,CH2,CZ2,CB,C,0 
SAVI8:GLY E7:N,CA,C,0 
45 SAVI8:ILE E8:N,CA,CD1,CG1,CB,CG2 ,C,0 
SAVI8:SER E9:N,CA,0G,CB,C,O 

SAVI8:ARG E10:N,CA,NH2,NH1,CZ, NE , CD, CG, CB, C, O 
SAVI8:VAL Ell :N, CA, CG2 , CGI , CB, C, O 
SAVI8:GLN E12 :N, CA,NE2 ,OEl , CD, CG ,CB, C,0 
50 SAVI8:ALA E13 :N,CA,CB,C,0 

SAVI8:PRO E14 : N, CD, CA, CG, CB, C, O 
SAVI8:ALA E15:N,CA,CB,C,0 

SAVI8:HIS E17 :N, CA, CD2 , NE2 , CE1 ,ND1, CG, CB, C, O 
SAVI8:ASN E18:N,CA,ND2,0D1,CG,CB,C,0 
55 SAVI8:THR E22 : N, CA, CG2 , OG1, CB, C, O 
SAVI8:THR E38 : N, CA, CG2 ,0G1,CB, C, O 
SAVI8:HIS E39:N / CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
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SAVI8 : 


PRO 


E40:N,CD,CA,CG,CB,C,0 




SAVI8 : 


LEU 


E42:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


ASN 


E43:N,0A,ND2,0Dl,CG,CB,C,O 




SAVI8 : 


ALA 


E73:N,CA,CB,C,0 


5 


SAVI8 : 


ALA 


E74:N,CA,CB,C,0 




SAVI8 : 


LEU 


E75:N,CA,0D2,CDl,CG,CB,C,O 




SAVI8 : 


ASN 


E76:N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


LEU 


E82:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


GLY 


E83:N,CA,C,0 


10 


SAVI8 : 


VAL 


E84:N,CA,CG2,CG1,CB,C,0 




SAVI8 : 


ALA 


E85:N,CA,0B,C,O 




SAVI8 : 


PRO 


E86:N,CD,CA,CG,CB,C,0 




SAVI8 : 


SER 


E103: 


N,CA,OG,CB,C,0 




SAVI8 : 


VAL 


E104: 


N,CA,CG2 ,CG1,CB,C,0 


15 


SAVI8 : 


SER 


E105: 


N,CA, 00,06,0,0 




SAVI8 : 


ALA 


E108: 


N,CA,CB,C,0 




SAVI8 : 


GLN 


E109: 


N,CA,NE2,0E1,CD,CG,CB,C,0 




SAVI8 : 


LEU 


Elll: 


N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


GLU 


E112: 


N, CA,0E2 ,0E1 , CD, CG, CB, 0,0 


20 


SAVI8 ; 


GLY 


E115: 


N, OA, 0,0 




SAVI8 : 


ASN 


E116: 


N,CA,ND2,ODl,CG,CB,C,0 




SAVI8 : 


ALA 


E122: 


N,CA,CB,C,0 




SAVI8 : 


SER 


E128: 


N,CA,OG,CB,C,0 




SAVI8 : 


PRO 


E129: 


N,CD,CA,CG,CB,C,0 


25 


SAVI8 : 


SER 


E130: 


N,CA,0G,CB,C,0 




SAVI8 : 


PRO 


E131: 


N,CD,CA,CG,CB,C,0 




SAVI8 : 


SER 


E132: 


N,CA,OG,CB,C,0 




SAVI8 : 


ALA 


E133: 


N,CA,CB,C,0 




SAVI8 : 


THR 


E134: 


N,CA,CG2,0G1,CB,C,0 


30 


SAVI8 : 


LEU 


E135: 


N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


GLU 


E136: 


N,CA,0E2 ,OEl,CD,CG,CB,C,0 




SAVI8 : 


.GLN 


E137: 


N,CA,NE2,OE1,CD,CG,CB,C,0 




SAVI8 : 


ALA 


E138: 


N,CA,CB,C,0 




SAVI8 : 


:VAL 


E139: 


:N,CA,CG2,CG1,CB,C,0 


35 


SAVI8 : 


:ASN 


E140: 


:N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


:SER 


E141: 


:N,CA,OG,CB,C,0 




SAVI8 


: ALA 


E142 


:N,CA,CB,C,0 




SAVI8 


:THR 


E143" 


:N,CA,CG2,OG1,CB,C,0 




SAVI8 


:SER 


E144: 


:N,CA,0G,CB,C,0 


40 


SAVI8 


:VAL 


E149 


:N,CA,CG2,CG1,CB,C,0. 




SAVI8 


:VAL 


E150 


:N,CA,CG2,CG1,CB,C,0 




SAVI8 


:SER 


E156 


:N,CA,0G,CB,C,0 




SAVI8 


: GLY 


E157 


:N,CA,C,0 




SAVI8 


: ALA 


E160 


:N,CA,CB,C,0 


45 


SAVI8 


:GLY 


E161 


:N,CA,C,0 




SAVI8 


:SER 


E162 


:N,CA,0G,CB,C,0 




SAVI8 


:ILE 


E165 


:N,CA,CD1,CG1,CB,CG2,C,0 




SAVI8 


:SER 


E166 


:N,CA,OG,CB,C,0 




SAVI8 


:TYR 


E167 


:N,CA,OH,CZ,CD2,CE2,CEl,CDl,CG,CB,C,0 


50 


SAVI8 


:PR0 


E168 


:N,CD,CA,CG,CB,C,0 




SAVI8 


: ARG 


E170 


:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 




SAVI8 


:TYR 


E171:N,CA,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 




SAVI8 


:ASN 


E173 


:N,CA,ND2,ODl,CG,CB,C,0 




SAVI8 


:THR 


E180:N,CA,CG2,OG1,CB,C,0 


55 


SAVI8 


:ASP 


E181 


:N,CA,0D2,0D1,CG,CB,C,0 




SAVI8 


:GLN 


E182:N,CA,NE2,OEl,CD,CG,CB,C,0 




SAVI8 


:ASN 


E183:N,CA,ND2,OD1,CG,CB,C,0 
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SAVI8: 


ASN 


E184: 


N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8: 


ASN 


E185: 


N, 


CA,ND2 ,001,06,05,0,0 






SAVI8: 


ARG 


E186: 


N, 


CA,NH2,NH1,CZ,NE,CD,CG,CB,C, 


0 




SAVI8: 


ALA 


E187: 


N # 


CA,CB,C,0 




5 


SAVI8: 


SER 


E188: 


N, 


CA,0G,CB,C,0 






SAVI8: 


SER 


E190: 


N, 


CA,OG,CB,C,0 






SAVI8: 


GLN 


E191: 


N, 


OA , NE2 , 0E1 , CD , CG , CB , C , O 






SAVI8 : 


JTYR 


E192: 


N, 


CA,0H,CZ,CD2,CE2,CE1,CD1,CG, 


CB,C,0 




SAVI8 J 


: ALA 


E200: 


N, 


CA,CB,C,0 




10 


SAVI8 : 


:VAL 


E203: 


N| 


CA,CG2,CG1,CB,C,0 






SAVI8 < 


:ASN 


E204: 


N| 


CA,ND2,0D1,CG,CB,C,0 






SAVI8 ; 


: GLN 


E206i 


>Nj 


OA , NE2 , 0E1 , CD , CG ,08,0,0 






SAVI8 


:GLY 


E21U 


;N, 


CA,C,0 






SAVI8 


:SER 


E212: 


:N, 


CA,0G,CB,C,0 




15 


SAVI8 


:THR 


E213J 


:N, 


,CA,CG2,0G1,CB,C,0 






SAVI8 


: ALA 


E215: 


•N, 


CA,CB,C,0 






SAVI8 


:SER 


E216: 


:N, 


,CA,OG,CB,C,0 






SAVI8 


:VAL 


E227; 


:N, 


f CA,CG2,CGl,CB,C,0 






SAVI8 


: ALA 


E228; 


:N, 


,CA,CB,C,0 




20 


SAVI8 


:GLY 


E229: 


:N, 


,CA,C,0 






SAVI8 


: ALA 


E230: 


:N, 


,CA,CB,C,0 






SAVI8 


:THR 


E255: 


;n. 


, OA , CG2 , 0G1 , CB ,0,0 






SAVI8 


:SER 


E256 




,CA,OG,CB,C,0 






SAVI8 


:LEU 


E257 


:N 


, OA , CD2 , CD1 , CG , CB , C , 0 




25 


SAVI8 


:GLY 


E258 


:N 


, OA, 0,0 






SAVI8 


:SER 


E259 


:N 


,CA,OG,CB,C,0 






SAVI8 


:ASN 


E261 


:N 


,CA,ND2,0D1,CG,CB,C,0 






SAVI8 


:LEU 


E262 


:N 


r CA,CD2,CDl,CG,CB,C,0 






SAVI8 


:LEU 


E267 


:N 


r CA,CD2,CDl,CG,CB,C,0 




30 


SAVI8 


:VAL 


E268 


:N 


,CA,CG2,CG1,CB,C,0 






SAVI8 


:ASN 


E269 


:N 


r CA,ND2,0Dl,CG,CB,C,0 





Subset SUB5B: 

sub5bmole . list 
Subset SUB5B: 

35 SAVI8 : E2-E4 , E16 , E19-E21 , E23-E24 , E28 , E37 , E4 1 , E44-E45 , 
E77-E81,E87-E88, 

SAVI8 : E90 , E113-E114 , E117-E118 , E120-E121 , E14 5- 
E148 , E169 , E172 , E174-E176 , 
SAVI8:E193-E196,E198-E199,E214,E231- 
40 E234,E236,E243,E247,E250,E253-E254, 

SAVI8 : E260 , E2 63-E266 , E270-E273 , M276H-M277H 

sub5batom. list 
Subset SUB5B: 

SAVI 8 : GLN E2 : N , OA , NE2 , 0E1 , CD , CG , CB , C , O 
45 SAVI8:SER E3 :N,CA,OG,CB,C,0 

SAVI8:VAL E4 :N,CA,CG2,CG1,CB,C,0 
SAVI8:ALA E16:N,CA,CB,C,0 

SAVI8:ARG E19 :N, CA,NH2 , NH1, CZ ,NE, CD, CG, CB, C, O 

SAVI8:GLY E20:N,CA,C,O 
50 SAVI8:LEU E21 :N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8:GLY E23:N,CA,C,0 

SAVI8:SER E24 :N,CA,OG,CB,C,0 

SAVI8:VAL E28:N,CA,CG2,CG1,CB,C,0 

SAVI8:SER E37:N,CA,OG,CB,C,0 
55 SAVI8:ASP E41:N,CA,OD2,ODl,CG,CB,C,0 

SAVI8:ILE E44:N,CA,CD1,CG1,CB,CG2,C,0 

SAVI8:ARG E45:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
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10 



SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
15 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
20 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
25 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
30 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 



35 



40 



45 



50 



ASN E77:N,CA,ND2,0D1,CG,CB,C,0 
SER E78:N,CA,OG, CB,C,0 
ILE E79:N,CA,CD1,CG1, 03,002,0,0 
GLY E80:N,CA,C,0 
VAL E81:N,CA,CG2,CG1,CB,C,0 
SER E87:N,CA,0G,CB,C,0 
E88:N,CA,CB,C,0 
E90:N,CA,CD2,CDl,CG,CB,C,O 



55 



ALA 
LEU 

TRP E113 :N, CA, CD2 , CE2 , NE1, CD1, CG, CE3 , CZ3 , CH2 , CZ2 , CB, C, O 
ALA E114:N,CA,CB,C,0 
ASN E117:N,CA f ND2,0Dl,CG,CB,C,0 
GLY E118:N f CA,C,0 

HIS E120:N,CA,CD2,NE2,CE1,ND1, 00,08,0,0 
VAL E121:N,CA f CG2,CGl,CB,C,0 
ARG E145:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
GLY E146:N,CA,C,0 
VAL E147:N,CA, 062,001,06,0,0 
LEU E148:N,CA,CD2, 001,00,06,0,0 
ALA E169:N,CA,CB,C,0 
ALA E172:N,CA,CB,C,0 
ALA E174:N,CA,CB,C,0 
MET E175:N,CA,CE,SD,CG,0B,C,O 
ALA E176:N,CA,CB,C,0 
GLY E193:N,CA,C,0 
ALA E194:N,CA,CB,C,0 
GLY E195:N,CA,C,0 
LEU E196:N,CA,CD2,CD1,CG,CB,C,0 
ILE E198:N,CA,CD1,CG1,CB,CG2,C,0 
VAL E199:N,CA,CG2,CG1,CB,C,0 

TYR E214:N,CA,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 
ALA E231:N,CA,CB,C,0 
ALA E232:N,CA,CB,C,0 
LEU E233:N,CA,CD2,CD1,CG,CB,C,0 
VAL E234:N,CA,CG2,CG1,CB,C,0 
GLN E236:N,CA,NE2,OEl,CD,CG,CB,0,O 
ASN E243:N,CA,ND2,0D1,CG,CB,C,0 
ARG E247:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
LEU E250:N,CA,CD2,CDl,CG,CB,C,O 
THR E253:N,CA,0G2,OGl,CB,C,O 
ALA E254:N,CA,CB,C,0 
THR E260:N,CA,CG2,OG1,CB,C,0 

TYR E2 6 3 : N , CA , OH , CZ , CD2 , CE2 , CE1 , CD 1 , CG , CB , C , O 
GLY E264:N,CA,C,0 
SER E265:N,CA,OG,CB,C,0 
GLY E266:N,CA,C,0 
ALA E270:N,CA,CB,C,O 
GLU E271:N,CA,OE2,OEl,CD,CG,CB,C,0 
ALA E272:N,CA,CB,C,0 
ALA E273:N,CA,CB,C,0 
ION M276H:CA 
ION M277H:CA 
Subset ACTSITE: 

actsitemole. list 
Subs 6 t ACTSITE* 

SAVI8 : E29-E35 , E48-E51 , E54 , E58-E72 , E91-E102 , E106-E107 , E110 , E123- 
E127, 
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SAVI8 : E151-E155,E177-E179,E189,E201-E202,E205,E207-E210,E217- 
E226 



actsiteatom. list 
Subset ACTSITE: 





SAVI8: 


ALA 


E29: 


N 




SAVI8: 


VAL 


E30: 


N 




SAVI8 : 


LEU 


E31: 


N 




SAVI8: 


ASP 


E32: 


N 


10 


SAVI8: 


THR 


E33: 


N 




SAVI8: 


GLY 


E34: 


N 




SAVI8 : 


ILE 


E35: 


N 




SAVI8: 


ALA 


E48: 


N 




SAVI8: 


SER 


E49: 


N 


15 


SAVI8: 


PHE 


E50: 


N 




SAVI8: 


VAL 


E51; 


N 




SAVI8 : 


GLU 


E54: 


N 




SAVI8 : 


THR 


E58: 


N 




SAVI8: 


GLN 


E59: 


N 


20 


SAVI8 : 


ASP 


E60: 


N 




SAVI8 : 


GLY 


E61: 


N 




SAVI8 : 


ASN 


E62: 


N 




SAVI8 : 


GLY 


E63: 


.N 




SAVI8: 


HIS 


E64: 


:N 


25 


SAVI8: 


GLY 


E65: 


:N 




SAVI8 : 


THR 


E66' 


:N 




SAVI8 : 


HIS 


E67 


:N 




SAVI8 : 


VAL 


E68 


:N 




SAVI8: 


ALA 


E69 


:N 


30 


SAVI8: 


GLY 


E70 


:N 




SAVI8 : 


THR 


E71 


:N 




SAVI8 : 


ILE 


E72 


:N 




SAVI8 : 


TYR 


E91 


:N 




SAVI8 : 


ALA 


E92 


:N 


35 


SAVI8 : 


:VAL 


E93 


:N 




SAVI8: 


.LYS 


E94 


:N 




SAVI8: 


:VAL 


E95 


:N 




SAVI8: 


:LEU 


E96 


:N 




SAVI8 


: GLY 


E97 


:N 


40 


SAVI8 


: ALA 


E98 


:N 




SAVI8 


:SER 


E99 


:N 




SAVI8 


:GLY 


E100: 




SAVI8 


:SER 


Eioi: 




SAVI8 


:GLY 


E102: 


45 


SAVI8 


:SER 


E106: 




SAVI8 


:ILE 


E107: 




SAVI8 


:GLY 


E110: 




SAVI8 


:ASN 


E123: 




SAVI8 


:LEU 


E124: 


50 


SAVI8 


:SER 


E125: 




SAVI8 


:LEU 


E126: 




SAVI8 


: GLY 


E127: 




SAVI8 


: ALA 


E151: 




SAVI8 


: ALA 


E152: 


55 


SAVI8 


:SER 


E153: 




SAVI8 


: GLY 


E154: 




SAVI8 


:ASN 


E155: 



, CA,CB,C,0 
,CA,CG2,CG1,CB,C,0 
, CA , CD2 , CD 1 , CG , CB , C , O 
,CA,OD2,ODl,CG,CB,C,0 
,CA,CG2,0G1,CB,C,0 
,Ch,C,0 

, CA , CD1 , CGI , CB , CG2 , C , O 
,CA,CB,C,0 
,CA,OG, CB,C,0 

,CA,CD2,CE2,CZ,CE1,CD1,CG,CB,C,0 
,CA,CG2,CG1,CB,C,0 
,CA,OE2,OEl,CD,CG,CB,C,0 
,CA,CG2,0G1,CB,C,0 
,CA,NE2,0E1,CD,CG,CB,C,0 
,CA,OD2,OD:L,CG,CB,C,0 
,CA,C,0 

,CA,ND2,0D1,CG,CB,C,0 
,CA,C,0 

,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
,CA,C,0 

, CA,CG2,0G1 / CB,C,0 
,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
, CA r CG2 / CGl / CB / C / 0 
,CA,CB, C,0 
,CA,C,0 

,CA,CG2,0G1,CB,C,0 
,CA,CD1,CG1,CB, CG2,C,0 
,CA,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 

,CA, CB,C,0 
,CA,CG2,CG1,CB,C,0 
,CA,NZ,CE,CD,CG,CB,C,0 
,CA,CG2,CG1,CB,C,0 
,CA,CD2,CD1,CG,CB,C,0 
,CA,C,0 
,CA,CB,C,0 
,CA,OG,CB,C,0 
:N, CA, C,0 
:N,CA,OG,CB,C,0 
:N,CA,C,0 
:N,CA,OG,CB,C,0 
:N,CA,CD1,CG1,CB,CG2,C,0 
:N,CA,C,0 

:N,CA,ND2,ODl,CG,CB, C,0 
:N,CA,CD2,CD1,CG,CB,C,0 
: N , C A , OG , CB , C , O 
:N,CA,CD2,CD1,CG,CB,C,0 
:N,CA,C,0 
:N,CA,CB,C,0 
:N,CA,CB,C,0 
:N,CA,OG,CB,C,0 
:N,CA,C,0 

:N, CA,ND2,ODl,CG,CB,C,0 
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SAVI8:VAL E177 
SAVI8:GLY E178 
SAVI8:ALA E179 
SAVI8:PHE E189 
5 SAVI8:PRO E201 

SAVI8:GLY E202 
SAVI8:VAL E205 
SAVI8:SER E207 
SAVI8:THR E208 
10 SAVI8:TYR E209 

SAVI8:PRO E210 
SAVI8:LEU E217 
SAVI8:ASN E218 
SAVI8:GLY E219 
15 SAVI8:THR E220 

SAVI8:SER E221 
SAVI8:MET E222 
SAVI8:ALA E223 
SAVI8:THR E224 
20 SAVI8:PRO E225 

SAVI8:HIS E226 
Subset RESTx: 

restxmole . list 
Subset RESTX: 
25 NEWMODEL: E5 , E13-E14 , E22 , E38-E40 , 

E42, E73-E76,E82-E86,E103-E105, 
NEWMODEL: E108 , E122 , E133-E135 , E137-E140 , 

E149-E150 , E173 , E204 , E206 , 
NEWMODEL: E211-E213 , E215-E216, E227- E229 , 
30 E258,E269 
restxatom. list 
Subset RESTX: 

NEWMODEL : PRO E5:N, CD, CA,CG,CB,C,0 
NEWMODEL: ALA E13 :N,CA, CB, C,0 
35 NEWMODEL : PRO E14 :N, CD, CA, CG, CB, C,0 

NEWMODEL : THR E2 2 : N , CA , CG2 , OG1 , CB , C , 0 
NEWMODEL : THR E3 8 : N , CA , CG2 , OG 1 , CB , C , O 
NEWMODEL : HI S E39:N,CA,CD2 ,NE2 ,CE1,ND1,CG,CB,C,0 
NEWMODEL: PRO E40 :N, CD, CA, CG, CB, C, O 
40 NEWMODEL : LEU E42 :N, CA, CD2 , CD1 , CG, CB, C, O 

NEWMODEL : ALA E73 :N, CA, CB, C, O 
NEWMODEL : ALA E74 :N,CA,CB,C,0 
NEWMODEL: LEU E75 : N , CA, CD 2 , CD1 , CG , CB , C , O 
NEWMODEL: ASN E76 :N, CA,ND2 ,0D1, CG, CB, C, O 
45 NEWMODEL : LEU E82 :N, CA, CD2 , CD1 , CG, CB, C, O 

NEWMODEL: GLY E83:N,CA,C,0 
NEWMODEL : VAL E8 4 : N , CA , CG2 , CGI , CB , C , O 
NEWMODEL: ALA E85:N, CA, CB, C,0 
NEWMODEL : PRO E86 :N, CD, CA, CG, CB, C, O 
50 NEWMODEL: SER E103 : N, CA, OG, CB, C, 0 

NEWMODEL : VAL E104 : N , CA , CG2 , CGI , CB , C , O 
NEWMODEL: SER E105:N,CA,OG,CB,C,0 
NEWMODEL : ALA E108:N,CA,CB,C,0 
NEWMODEL : ALA E122:N,CA,CB,C,0 
55 NEWMODEL : ALA E133 :N,CA,CB,C,0 

NEWMODEL : THR E134 : N , CA , CG2 , OG1 , CB , C , O 
NEWMODEL: LEU E135:N,CA,CD2 ,CD1,CG,CB,C,0 



,UA, 

/CA, 
, CA, 
,CA, 
, CD, 
/CA, 
/CA, 
/CA, 
/CA, 
/CA, 
/CD, 
/CA, 
/CA, 
, CA, 
, CA, 
/CA, 
,CA, 
, CA, 
/CA, 
/CD, 
,CA, 



C,0 

CB,C,0 



,CD2,CE2,CZ, CE1,CD1,CG,CB,C,0 
,CA, CG,CB,C,0 
C,0 



,u,u 

,CG2,CG1,CB,C,0 
,OG,CB,C,0 
,CG2,0G1,CB,C,0 

,OH,CZ,CD2,CE2,CEl,CDl,CG r CB,C,0 
,CA,CG,CB,C,0 
r CD2,CDl,CG,CB,C,0 
,ND2,0D1,CG,CB,C,0 
r C,0 

r CG2,OGl,CB,C,0 
r OG,CB,C,0 
,CE,SD,CG,CB,C,0 
,CB r C,0 

,CG2,0G1,CB,C,0 
,CA r CG,CB,C,0 

,CD2 f NE2, CEl^Dl^G^B^^ 
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10 



15 



5 



NEWMODEL : GLN 
NEWMODEL : ALA 
NEWMODEL : VAL 
NEWMODEL : ASN 
NEWMODEL: VAL 
NEWMODEL: VAL 
NEWMODEL : ASN 
NEWMODEL: ASN 
NEWMODEL: GLN 
NEWMODEL :GLY 
NEWMODEL : SER 
NEWMODEL :THR 
NEWMODEL: ALA 
NEWMODEL: SER 
NEWMODEL: VAL 
NEWMODEL: ALA 
NEWMODEL :GLY 
NEWMODEL :GLY 
NEWMODEL: ASN 



E137 
E138 
E139 
E140 
E149 
E150 
E173 
E204 
E206 
E211 
E212 
E213 
E215 
E216 
E227 
E228 
E229 
E258 
E269 



N , CA , NE2 , OE1 , CD , CG , CB , C , O 

N,CA,CB, C,0 

N , CA , CG2 , CGI , CB , C , O 

N,CA,ND2,ODl, CG,CB, C,0 

N,CA,CG2 / CGl,CB,C / 0 

N , CA , CG2 , CGI , CB , C , O 

N,CA,ND2,ODl, CG,CB, C,0 

N,CA,ND2,ODl,CG,CB,C,0 

N , CA , NE2 , OE1 , CD , CG , CB , C , O 

N,CA,C,0 

N,CA,OG,CB,C,0 

N,CA,CG2,OGl,CB,C,0 

N,CA,CB,C,0 

N,CA,OG,CB,C,0 

N , CA , CG2 , CGI , CB , C , O 

N,CA,CB,C,0 

N,CA,C,0 

N , C A , C t O 

n!ca'nd2,odi,cg,cb,c,o 



20 



Example 3 

Suitable substitutions in PD498 for addition of carboxvlic acid 
attachment groups (-COOH) 

The 3D structure of PD498 was modeled as described in 
25 Example 1. 

Suitable locations for addition of carboxylic attachment groups 
(Aspartatic acids and Glutamic acids) were found as follows. 
The procedure described in Example 1 was followed. The 
commands performed in Insight (BIOSYM) are shown in the command 
30 files makeDEzone.bcl and makeDEzone2.bcl below: 



Conservative substutitions: 

makeDEzone.bcl 

Delete Subset * 

35 Color Molecule Atoms * Specified Specification 255,0,255 

Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer/residue 10 Color_Subset 
255,255,0 

40 #N0TE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERM : 280:0 Static monomer /residue 10 Color_Subset 
255 255 0 

#NOTE: editnextline ACTSITE residues according to the protein 
45 Zone Subset ACTSITE : 39,72, 226 Static monomer /residue 8 

Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 

Combine Subset ALLZONE Union ALLZONE CTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 
50 #NOTE: editnextline object name according to the protein 

Combine Subset REST Difference PD4 9 8FINALMODEL ALLZONE 
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List Subset REST Atom Output_File restatom. list 
List Subset REST monomer /residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specif ication 255, 0, 0 
List Subset ACTSITE Atom Output_File act siteatom. list 
5 List Subset ACTSITE monomer/ residue Output_File 
actsitemole. list 
# 

Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 

10 Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File subSba torn. list 
List Subset SUB5B monomer/residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 

15 . . • 

#continue with makezone2.bcl. 

#Use grep command to identify asn/gln in restatom. list ... 
#sub5batom. list & accsiteatom. list 

20 Comments: 

The subset REST contains Gln33 and Asn245, SUB5B contains 
Glnl2, Glnl26, Asn209, Gln242, Asn246, Gln248 and Asn266, all 
of which are solvent exposed. 

The substitutions Q12E or Q12D, Q33E or Q33D, Q126E or 
25 Q126D, N209D or N209E, Q242E or Q242D, N245D or N245E, N246D or 
N246E, Q248E or Q248D and N266D or N266E are identified in 
PD498 as sites for mutagenesis within the scope of this 
invention. Residues are substituted below in section 2, and 
further analysis done: 

30 

Non-conservative substitutions: 
makeDEzone2.be! 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace PD4 9 8 FINALMODEL newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On PD4 98 FINALMODEL 

#NOTE: editnextlines with asn->asp & gln->glu positions 

Replace Residue newmodel: 33 glu L 
45 Replace Residue newmodel: 245 asp L 

Replace Residue newmodel: 12 glu L 

Replace Residue newmodel: 126 glu L 

Replace Residue newmodel: 209 asp L 

Replace Residue newmodel: 242 glu L 
50 Replace Residue newmodel: 246 asp L 

Replace Residue newmodel: 248 glu L 
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Replace Residue newmodel: 266 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, — 
#now including introduced asp & glu 
5 Color Molecule Atoms newmodel Specified Specification 255,0,255 
Zone Subset ASPx newmodel: asp rod* Static monomer /residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel: glu :oe* Static monomer /residue 10 
Color_Subset 255,255,0 
10 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel : 280 :0 Static monomer/ residue 10 
Color_ Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
15 Zone Subset ACTSITEx newmodel : 39 , 72 , 226 Static monomer/residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
20 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output_File restxatom. list 

List Subset RESTx monomer/residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
25 List Subset ACTSITEx Atom Output^File actsitexatom. list 
List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 

* . 

#read restxatom. list or restxmole. list to identify sxtes for 

30 (not_gluasp) ->gluasp . . . 

#subst. if needed 

Comments : 

The subset RESTx contains only two residues: A233 and G234, 

35 none of which are solvent exposed. No further mutagenesis is 

required to obtain complete protection of the surface. 

However, it may be necessary to remove some of the reactive 

carboxylic groups in the active site region to ensure access to 

the active site of PD498. Acidic residues within the subset 

40 ACTSITE are: D39, D58, D68 and D106. Of these only the two 

latter are solvent exposed and D39 is a functional residue. The 

mutations D68N, D68Q, D106N and D106Q were found suitable 

according to the present invention. 

Relevant data for Example 3: 

45 Solvent accessibility data for PD498MODEL: see Example 1 above. 

Subset REST: 

restmole. list 
Subset REST* 

PD498FINALMODEL: 10-11, 33-35, 54-55, 129-130, 

50 221,233-234,23 6,24 0,243, 

PD498FINALMODEL: 245, 262 ,264-265 
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restatom. list 

Subset REST: 

PD4 9 8 FINALMODEL : ALA 
5 PD4 9 8 FINALMODEL :TYR 

PD4 9 8 FINALMODEL : GLN 

PD498 FINALMODEL : THR 

PD4 9 8 FINALMODEL : VAL 

PD4 98 FINALMODEL : ILE 
10 PD 4 9 8 FINALMODEL : LY S 

PD4 9 8 FINALMODEL : LYS 

PD4 9 8 FINALMODEL : VAL 

PD4 9 8 FINALMODEL : TYR 

PD4 9 8 FINALMODEL : ALA 
15 PD4 9 8 FINALMODEL :GLY 

PD4 9 8 FINALMODEL : ALA 

PD4 9 8 FINALMODEL : ALA 

PD4 9 8 FINALMODEL :GLY 

PD4 9 8 FINALMODEL : ASN 
20 PD4 9 8FINALMODEL : GLY 

PD498FINALMODEL: GLY 

PD4 9 8 FINALMODEL : THR 
Subset SUB5B: 
subSbmole. list 
25 Subset SUB5B: 

PD4 9 8 FINALMODEL: 6-9 , 12-13 , 31-32 , 51-53 , 56 , 81,93-94 , 97- 

99 122 126—128 

PD498FINALMODEL: 131 ,155-157, 159 ,197-199 ,209 ,211, 219- 
220,232,235, 

30 PD498FINALMODEL: 237-239, 241-242, 244, 246-249, 253 , 2 60- 

261,263,266-268 
subSbatom. list 

Subset SUB5B: 

PD4 9 8 FINALMODEL : PRO 6 : N , CA , CD , C , O , CB , CG 
35 PD4 9 8FINALMODEL : TYR 7 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ ,OH 

PD4 9 8 FINALMODEL: TYR 8 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : SER 9 : N , CA , C , 0 , CB , OG 

PD4 9 8 FINALMODEL : GLN 12 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

PD4 9 8 FINALMODEL : TYR 13 :N, CA, C,0, CB, CG, CD1 , CD2 , CE1, CE2 , CZ, OH 
40 PD498FINALMODEL: SER 3 1 : N , CA , C , O , CB , OG 

PD4 9 8FINALMODEL : THR 3 2 : N , CA , C , 0 , CB , OG1 , CG2 

PD4 9 8 FINALMODEL : ARG 51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

PD4 9 8 FINALMODEL : LYS 5 2 : N , CA , C , O , CB , CG , CD , CE , NZ 

PD4 9 8 FINALMODEL : VAL 5 3 : N , CA , C , 0 , CB , CG 1 , CG2 
45 PD4 9 8 FINALMODEL: GLY 56:N,CA,C,0 

PD4 9 8 FINALMODEL : ALA 81 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL : MET 9 3 : N , CA , C , O , CB , CG , SD , CE 

PD4 9 8FINALMODEL : ALA 94:N,CA,C,0,CB 

PD 498 FINALMODEL : THR 97 : N , CA , C , 0 , CB , OG1 , CG2 
50 PD4 9 8 FINALMODEL : LYS 98:N,CA,C,0,CB,CG,CD,CE,NZ 

PD4 9 8 FINALMODEL : ILE 9 9 : N , CA , C , 0 , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL : TYR 12 2 : N , CA, C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : GLN 1 2 6 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

PD4 9 8 FINALMODEL: GLY 127:N,CA,C,0 
55 PD498FINALMODEL: ALA 128 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL : LEU 1 3 1 : N , CA , C , O , CB , CG , CD 1 , CD2 

PD4 9 8 FINALMODEL : GLY 155:N,CA,C,0 



10:N,CA,C,O,CB 

11:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

33:N,CA,C,0,CB,CG,CD,0E1,NE2 

34:N,CA,C,0,CB,0G1,CG2 

35:N,CA,C,0,CB,CG1,CG2 

54:N,CA,C,0,CB,CG1,CG2,CD1 

55:N,CA,C,0,CB,CG,CD,CE,NZ 

129:N,CA,C,0,CB,CG,CD,CE,NZ 

13 0:N,CA,C,O,CB,CGl,CG2 

221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

233:N,CA,C,0,CB 

234:N,CA,C,0 

236:N,CA,C,0,CB 

240:N,CA,C,O,CB 

243:N,CA,C,0 

245:N,CA,C,0,CB,CG,0D1,ND2 
262:N,CA,C,0 
264:N,CA,C,0 
265:N,CA,C,0,CB,0G1,CG2 
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PD4 9 8 FINALMODEL : 


ALA 


156: 


N| 


CA, 


c, 


o, 


CB 






PD4 9 8 FINALMODEL : 


VAL 


157: 


N, 


CA, 


c, 


o, 


05,001,002 






PD4 9 8 FINALMODEL: 


VAL 


159: 


N, 


CA, 


c, 


o, 


CB, CG1,CG2 






PD4 9 8 FINALMODEL: 


TYR 


197: 


N, 


CA ; 


c, 


o, 


CB,CG,CD1,CD2,CE1, 


CE2 


, CZ , OH 


PD4 9 8 FINALMODEL : 


GLY 


198: 


N, 


CA # 


c, 


0 








PD 4 9 8 F I N ALMOD EL : 


THR 


199: 


N, 


CA # 


c, 


o, 


CB,0G1,CG2 






PD4 9 8 FINALMODEL: 


ASN 


209: 


»N, 


CA, 


c, 


o, 


CB,CG,0D1,ND2 






PD4 9 8 FINALMODEL: 


ALA 


211: 


k N, 


CA, 


c, 


0| 


CB 






PD4 9 8 FINALMODEL : 


TYR 


219: 


k N, 


CA # 


c, 


Oi 


CB,CG,CD1,CD2,CE1, 


CE2 


,CZ,OH 


PD4 9 8 FINALMODEL J 


SER 


220: 


;N, 


CA, 


c, 


o, 


CB,OG 






PD4 9 8 FINALMODEL : 


VAL 


232: 


•N, 


CA, 


c, 


o, 


CB,CG1,CG2 






PD4 9 8 FINALMODEL: 


LEU 


235: 


»N, 


CA, 


c, 


o. 


CB,CG,CD1,CD2 






PD4 9 8 FINALMODEL J 


ALA 


237: 


'N, 


CA, 


c, 


o ( 


CB 






PD4 9 8 FINALMODEL: 


LEU 


238: 


:N, 


CA, 


c, 


o ( 


CB,CG,CD1,CD2 






PD4 9 8 FINALMODEL : 


;LEU 


239: 


;N, 


CA, 


c, 


o, 


CB,CG,CD1,CD2 






PD4 9 8 FINALMODEL: 


:SER 


241 


;N, 


CA, 


c, 


o 4 


CB,OG 






PD4 9 8 FINALMODEL : 


:GLN 


242 


:N, 


CA, 


c, 


o, 


,CB,CG,CD,0E1,NE2 






PD4 98FINALMODEL : 


:LYS 


244 


:N, 


CA, 


c, 




f CB,CG,CD,CE,NZ 






PD4 9 8 FINALMODEL : 


:ASN 


246 


:N, 


CA, 






,CB,CG,ODl,ND2 






PD4 98 FINALMODEL: 


:VAL 


247 


:N, 


CA, 


c, 




,CB,CG1,CG2 






PD4 9 8 FINALMODEL: 


: GLN 


248 


:N, 


CA, 


c t 


o, 


,CB,CG,CD,0E1,NE2 






PD49 8 FINALMODEL J 


:ILE 


249 


:N, 


,CA, 


c, 




F CB,CG1,CG2,CD1 






PD4 9 8 FINALMODEL 


:ILE 


253 


:N, 


,CA, 


c t 




,CB,CG1,CG2,CD1 






PD4 9 8 FINALMODEL 


:ILE 


260 


:N ] 


,CA, 


c, 




,CB,CG1,CG2,CD1 






PD4 9 8 FINALMODEL 


:SER 


261 


:N, 


,CA, 






P CB,OG 






PD4 9 8 FINALMODEL 


: THR 


263 


:N 


rCA, 


c, 


,0 


,CB,0G1,CG2 






PD4 9 8 FINALMODEL 


:ASN 


266 


:N 


,CA, 


c, 


,0 


,CB,CG,ODl,ND2 






PD4 9 8 FINALMODEL 


:PHE 


267 


:N 


,CA, 




fO 


,CB,CG,CD1,CD2,CE1, 


CE2 


,CZ 


PD4 98 FINALMODEL 


:LYS 


268 


:N 


,CA 




fO 


,CB,CG,CD,CE,NZ 







30 Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL: 3 6-42 , 57-60 , 66-80 , 100-110 , 
115-116,119,132-136,160-164, 
35 PD4 9 8 FINALMODEL : 182-184 , 194 , 206-207 , 210 , 

212-215,222-231 
actsiteatom. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL : ALA 36:N,CA,C,0,CB 
40 PD4 9 8 FINALMODEL : VAL 37 :N, CA, C,Q, CB, CGI, CG2 

PD4 9 8 FINALMODEL : LEU 38 : N, CA, C , O , CB , CG , CD1 , CD2 
PD4 9 8 FINALMODEL : ASP 3 9 : N , CA, C , O , CB , CG , 0D1 , 0D2 
PD4 9 8 FINALMODEL: SER 40:N,CA,C,0,CB,OG 
PD4 9 8 FINALMODEL: GLY 41:N,CA,C,0 
45 PD4 9 8 FINALMODEL: VAL 42:N,CA,C,0,CB,CG1,CG2 

PD4 9 8 FINALMODEL : TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL: ASP 58 :N,CA, C,0,CB, CG,0D1,0D2 
PD498 FINALMODEL : PHE 
50 59:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL : ILE 6 0 : N , CA , C , O , CB , CG 1 , CG2 , CD 1 
PD4 9 8 FINALMODEL : PRO 66 : N , CA , CD , C , 0 , CB , CG 
PD498 FINALMODEL : MET 67 : N , CA , C , O , CB , CG , SD , CE 
PD4 9 8 FINALMODEL : ASP 6 8 : N , CA , C , O , CB , CG , 0D1 , 0D2 
5 5 PD4 9 8 FINALMODEL : LEU 6 9 : N , CA , C , O , CB , CG , CD 1 , CD2 

PD4 9 8 FINALMODEL : ASN 70 : N , CA , C , O , CB , CG , 0D1 , ND2 
PD4 9 8 FINALMODEL : GLY 71:N,CA,C,0 
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PD498 FINALMODEL : HI S 7 2 : N , CA , C , O , CB , CG , ND 1 , CD2 , CE1 , NE2 
PD4 9 8 F INALMODEL : GLY 73:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 74 : N, CA, C,0, CB,OGl , CG2 
PD4 9 8 FINALMODEL: HIS 75:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL :VAL 76 :N, CA, C, 0, CB, CGI , CG2 
PD4 9 8 FINALMODEL: ALA 77 :N, CA, C, 0, CB 
PD4 9 8 FINALMODEL: GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL: THR 79 :N, CA, C, 0, CB, 0G1 , CG2 
PD4 9 8 FINALMODEL :VAL 80 :N, CA, C, 0, CB, CGI , CG2 



PD4 9 8 FINALMODEL : LEU 


100: 


N, 


CA, 


c, 


o, 


CB, 


CG, 


CD1,CD2 


PD4 9 8 FINALMODEL : ALA 


101: 


N, 


CA, 


C| 


o, 


CB 






PD498 FINALMODEL : VAL 


102: 


N, 


CA, 


c, 


o, 


CB, 


CGI 


,CG2 


PD4 9 8 FINALMODEL : ARG 


103: 


N, 


CA, 


c, 


o, 


CB, 






CG,CD,NE,CZ,NH1 


,NH2 
















PD4 9 8 FINALMODEL : VAL 


104: 


N, 


CA, 


c, 


o, 


CB, 


CGI 


,CG2 


PD4 9 8 FINALMODEL : LEU 


105: 


N, 


CA, 


c, 


o, 


CB, 


CG, 


CD1,CD2 


PD4 9 8 FINALMODEL : ASP 


106: 


N, 


CA, 


c, 


o# 


CB, 


CG, 


ODl,OD2 


PD4 9 8 FINALMODEL : ALA 


107: 


N, 


CA, 


c, 


o, 


CB 






PD4 9 8 FINALMODEL : ASN 


108: 


N, 


CA, 


c, 


o, 


CB, 


CG, 


0D1,ND2 


PD498 FINALMODEL : GLY 


109: 


N, 


CA, 


c. 


0 








PD4 9 8 FINALMODEL : SER 


110: 


N, 


CA, 


c< 


o, 


CB, 


OG 




PD4 9 8 FINALMODEL: SER 


115: 


N, 


CA, 


c, 


o, 


CB, 


OG 




PD4 9 8 FINALMODEL : ILE 


116: 


N, 


CA, 


c, 


o, 


CB, 






CGI ,062,001 


















PD 4 9 8 FINALMODEL : GLY 


119: 


N, 


CA, 


c, 


r 0 








PD4 9 8 FINALMODEL : ASN 


132: 


N, 


CA, 


c. 




CB, 


CG, 


0D1,ND2 


PD49 8 FINALMODEL : LEU 


133: 


N, 


CA, 


c. 


rO, 


CB, 


CG, 


CD1,CD2 


PD4 9 8 FINALMODEL: SER 


134: 


N, 


CA, 


c 


r o, 


CB, 


OG 




PD 4 9 8 F I NALMODEL : LEU 


135: 


N, 


CA, 


c 




CB, 


CG, 


CD1,CD2 


PD 4 9 8 FINALMODEL : GLY 


136: 


N, 


CA, 


c 


f o 








PD4 9 8 FINALMODEL : ALA 


160: 


N, 


CA, 


c 




CB 






PD4 9 8 FINALMODEL : ALA 


161: 


N, 


CA, 


c 




CB 






PD49 8 FINALMODEL : ALA 


162: 


N, 


CA, 


c 




CB 






PD4 9 8 FINALMODEL : GLY 


163: 


N, 


CA, 


c 


r o 








PD4 9 8 FINALMODEL : ASN 


164: 


N, 


CA, 


c 




rCB, 


CG, 


0D1,ND2 


PD4 9 8 FINALMODEL : VAL 


182: 


,N, 


CA, 


c 


,0 


, CB, 


CGI 


,CG2 


PD4 9 8 FINALMODEL : GLY 


183: 


:N, 


,CA, 


c 


rO 








PD4 9 8 FINALMODEL : ALA 


184: 


;N, 


,CA, 


c 


rO 


r CB 






PD4 9 8 FINALMODEL : PHE 


194: 


:N, 


,CA, 


c 


,o 


rCB, 






CG,CD1,CD2,CE1, 


CE2, 


CZ 














PD498FINALMODEL: PRO 


206: 


:N, 


rCA, 


,CD,C,0, 


CB, 


CG 


PD 4 9 8 FINALMODEL : GLY 


207:N, 


rCA, 


,c 


,0 








PD498FINALMODEL: ILE 


210:N 


,CA, 


rC 


.0 


,CB, 
























PD4 9 8FINALM0DEL : SER 


212 


:N 


,CA 


rC 


,o 


,CB, 


OG 




PD4 9 8 FINALMODEL : THR 


213 


:N 


r CA 




,o 


rCB, 


0G1 


,CG2 


PD4 9 8FINALM0DEL : VAL 


214 


:N 


r CA 


r c 


,0 


,CB, 


CGI 


,CG2 


PD4 98FINALM0DEL : PRO 


215 


:N 


,CA 


, CD , C , 0 , 


CB, 


CG 


PD4 9 8 FINALMODEL : MET 


222 


:N 


,CA 




,0 


,CB, 


CG, 


SD,CE 


PD498FINALMODEL: SER 


223 


:N 


f CA 


r c 


,0 


,CB, 


OG 




PD4 9 8 FINALMODEL : GLY 


224 


:N 


,CA 


rC 


,0 








PD4 9 8 FINALMODEL : THR 


225 


:N 


,CA 


f c 


,0 


rCB, 


0G1,CG2 


PD4 9 8 FINALMODEL: SER 


226 


:N 


,CA 


f c 


,0 


,CB, 


OG 




PD4 9 8 FINALMODEL : MET 


227 


:N 


r CA 


r c 


,0 


,CB, 


CG, 


SD,CE 


PD498 FINALMODEL : ALA 


228 


:N 


,CA 


,c 


,0 


,CB 






PD4 9 8 FINALMODEL: SER 


229 


:N 


,CA 


,c 


,0 


,CB, 


OG 




PD49 8 FINALMODEL : PRO 


230 


:N 


,CA 


, CD, 


c:,o, 


CB, 


CG 
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PD4 9 8 FINALMODEL : HIS 231 : N, CA, C, O, CB, 
CG,ND1,CD2, CE1 / NE2 
Subset RESTx: 

restxmole. list 
5 Subset RESTX: 

NEWMODEL: 233-234 
restxa torn. list 
Subset RESTX: 

NEWMODEL : ALA 233:N,CA,C,0,CB 
10 NEWMODEL : GLY 234:N,CA,C,0 

Example 4 

Suitable substitutions in the Arthromyces ramosus peroxidase 
for addition of carboxvlic acid attachment g roups (-COOH) 
15 Suitable locations for addition of carboxylic attachment 
groups (Aspartatic acids and Glutamic acids) in a non- 
hydrolytic enzyme, Arthrojnyces ramosus peroxidase were found as 
follows. 

The 3D structure of this oxido-reductase is available in the 
20 Brookhaven Databank as larp.pdb. This A. ramosus peroxidase 
contains 344 amino acid residues. The first eight residues are 
not visible in the X-ray structure: QGPGGGGG, and N143 is 
glycosylated. 

The procedure described in Example 1 was followed. 
25 The amino acid sequence of Arthrojnyces ramosus Peroxidase 

(E.C.I. 11. 1.7) is shown in SEQ ID NO 4. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeDEzone.bcl and makeDEzone2 .bcl below. The c- 

terminal residue is P344, the ACTSITE is defined as the heme 
30 group and the two histidines coordinating it (H56 & H184) . 

Conservative substitutions: 

makeDEzone.bcl 

Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
35 Zone Subset ASP : asp :od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline C-terminal residue number according to the 
40 protein 

Zone Subset CTERM : 344:0 Static monomer/residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :HEM,56,184 Static monomer/residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 
Combine Subset ALLZONE Union ALLZONE CTERM 
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Combine Subset ALLZONE Union ALL ZONE ACTSITE 
#NOTE: editnextline object name according to the protein 
Combine Subset REST Difference ARP ALLZONE 
List Subset REST Atom Output File restatom. list 
5 List Subset REST monomer /residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output^File actsiteatom. list 
List Subset ACTSITE monomer/ residue Output_File 
actsitemole . list 
10 # 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
15 List Subset SUB5B Atom Output File sub5batom. list 

List Subset SUB5B monomer /residue Output_File sub5bmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 
* • • 

#continue with makezone2 . bcl. 
20 #Use grep command to identify asn/gln in restatom. list . ♦. 
#sub5batom. list & accsiteatom. list 

Comments : 

The subset REST contains Gln70, and SUB5B contains Gln34, 
25 Asnl28, Asn303 all of which are solvent exposed. The 

substitutions Q34E or Q34D, Q70E or Q70D, N128D or N128E and 
N3 03D or N3 03E are identified in A. ramosus peroxidase as sites 
for mutagenesis. Residues are substituted below and further 
analysis done: 

30 

Non-conservative substitutions: 
makeDEzone2 .bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#N0TE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace ARP newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On ARP 

#NOTE: editnext lines with asn->asp & gln->glu positions 
Replace Residue newmodel: 34 glu L 
45 Replace Residue newmodel: 70 glu L 
Replace Residue newmodel: 128 asp L 
Replace Residue newmodel: 303 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, . . . 
50 #now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
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Zone Subset ASPx newmodel rasp rod* Static monomer /residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel :glu roe* Static monomer/ residue 10 
Color_Subset 255,255,0 
5 #NOTE r editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel r 344 rO Static monomer /residue 10 
Color_Subset 255,255,0 

#NOTE r editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel r HEM, 56, 184 Static monomer /residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer/ residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
20 List Subset ACTSITEx Atom Output^File actsitexatom. list 
List Subset ACTSITEx monomer /residue Output_File 
actsitexmole . list 
# 

#read restxatom. list or restxmole. list to identify sites for 
25 (not_gluasp) ->gluasp . . . 
#subst. if needed 

Comments r 

The subset RESTx contains only four residuesr S9, S334, G335 

30 and P336, all of which are >5% solvent exposed. The mutations 
S9D, S9E, S334D, S334E, G335D, G335E, P336D and P336E are 
proposed in A. ramosus peroxidase. Acidic residues within the 
subset ACTSITE arer E44, D57, D77, E87, E176, D179, E190, D202, 
D209, D246 and the N-terminal carboxylic acid on P344. Of these 

35 only E44, D77, E176, D179, E190, D209, D246 and the N-terminal 
carboxylic acid on P344 are solvent exposed. Suitable sites for 
mutations are E44Q, D77N, E176Q, D179N, E190Q, D209N and D246N. 
D246N and D246E are risky mutations due to D246's importance 
for binding of heme. 

40 The N-terminal 8 residues were not included in the 

calculations above, as they do not appear in the structure. 
None of these 8 residues, QGPGGGG, contain carboxylic groups. 
The following variants are proposed as possible mutations to 
enable attachment to this region r Q1E, Q1D, G2E, G2D, P3E, P3D, 

45 G4E, G4D, G5E, G5D, G6E, G6D, G7E, G7D, G8E, G8D. 
Relevant data for Example 4r 
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Solvent accessibility data for A. ramosus peroxidase (Note: 
as the first eight residues are missing in the X-ray structure, 
the residue numbers printed in the accessibility list below are 
8 lower than those used elsewhere for residue numbering. 



5 


# ARP 


Thu Jan 30 15:39:05 MET 1997 




# residue 


area 




SER 1 


143.698257 




VAL 2 


54.879990 




THR 3 


86.932701 


10 


CYS 4 


8.303715 




PRO 5 


126.854782 




GLY 6 


53.771488 




GLY 7 


48.137802 




GLN 8 


62.288475 


15 


SER 9 


79.932549 




THR 10 


16.299215 




SER 11 


81.928642 




ASN 12 


51.432678 




SER 13 


81.993019 


20 


GLN 14 


92.344009 




CYS 15 


0.000000 




CYS 16 


32.317432 




VAL 17 


54.067810 




TRP 18 


6.451035 


25 


PHE 19 


25.852070 




ASP 20 


79.033997 




VAL 21 


0.268693 




LEU 22 


22.032858 




ASP 23 


90.111404 


30 


ASP 24 


43.993240 




LEU 25 


1.074774 




GLN 26 


25.589321 




THR 27 


82.698059 




ASN 28 


96.600883 


35 


PHE 29 


32.375275 




TYR 30 


5.898365 




GLN 31 


103.380585 




GLY 32 


40.042034 




SER 33 


46.789322 


40 


LYS 34 


87.161873 




CYS 35 


12.827215 




GLU 36 


51.582657 




SER 37 


16.378180 




PRO 38 


33.560043 


45 


VAL 39 


6.448641 




ARG 40 


7.068311 




LYS 41 


15.291286 




ILE 42 


1.612160 




LEU 43 


1.880854 


50 


ARG 44 


16.906845 




ILE 45 


0.000000 




VAL 46 


2.312647 




PHE 47 


2.955627 




HIS 48 


20.392527 


55 


ASP 49 


4.238116 
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ALA_50 
ILE_51 
GLY_52 
PHE_53 
5 SER_54 
PRO_55 
ALA_56 
LEU_57 
THR_58 

10 ALA_59 
ALA_60 
GLY_61 
GLN_62 
PHE_63 

15 GLY_64 
GLY_65 
GLY_66 
GLY_67 
ALA_68 

2 0 ASP_69 
GLYJ70 
SERJ71 
ILE_72 
ILEJ73 

25 ALAJ74 
HIS_75 
SERJ76 
ASNJ77 
ILEJ78 

30 GLUJ79 
LEU_80 
ALA_81 
PHE_82 
PRO_83 

35 ALA_84 
ASN_85 
GLY_86 
GLY_87 
LEU_88 

40 THR_89 
ASP_90 
THR_91 
ILE_92 
GLU_93 

45 ALA_94 
LEU_95 
ARG_96 
ALA_97 
VAL_98 

50 GLY_99 
ILE_100 
ASN_101 
HIS_102 
GLY_103 

55 VALJL04 
SER_105 
PHE 106 



0.510757 

1.576962 

2.858601 

48.633503 

8.973248 

58.822315 

59.782852 

46.483955 

86.744827 

89.515816 

81.163239 

70.119019 

112.635498 

93.522354 

2.742587 

13.379636 

22.722847 

0.000000 

0.268693 

12.074840 

0.700486 

0. 000000 

0.000000 

0. 000000 

17.304443 

41.071186 

20.000793 

120.855316 

66.574982 

2.334954 

41.329689 

77.370575 

38.758774 

131.946289 

34.893864 

5.457000 

43.364151 

51.561348 

0.242063 

73.343575 

130.139389 

17.863211 

0.268693 

92.210396 

35.445068 

1.343467 

31.175611 

44.650192 

17.698566 

I. 471369 
62.441463 
107.139748 
46.952496 
46.559296 

II. 342628 
15.225677 
6.422011 
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GLY 


107 


3.426864 




ASP 


108 


10.740790 




LEU 


109 


0.268693 




ILE 


110 


1.880854 


5 


GLN 


111 


31.867456 




PHE 


112 


0.000000 




ALA 


113 


0.000000 




THR 


114 


3.656114 




ALA 


115 


8.299393 


10 


VAL 


116 


0.268693 




GLY~ 


117 


0. 268693 




met" 


118 


3.761708 




SER~ 


119 


14.536770 




ASN~ 


120 


25.928799 


15 


CYS~" 


121 


0. 537387 




pro"" 


122 


29.798336 




GLY 


"123 


33.080013 




SER 


124 


17.115562 




PRO 


"125 


36.908714 


20 


arg" 


"126 


108.274727 




LEU 


"127 


21.238588 




GLU 


"128 


53.742313 




PHE 


"129 


3.761708 




LEU 


130 


12.928699 


25 


THR 


'131 


10.414591 




GLY 


132 


47 . 266495 




ARG~ 


"133 


12 .247048 




ser" 


"134 


63 .047237 




ASN 135 


31.403708 


30 


SER 


136 


97 .999619 




SER*" 


"137 


28.505201 




GLN" 


"138 


102.845520 




PRO" 


"139 


49.691917 




ser" 


"140 


9.423104 


35 


pro" 


"141 


25.724171 




pro" 


"142 


80.706665 




ser" 


"143 


105.318176 




leu" 


"144 


20.154398 




ile" 


"145 


41.288322 


40 


pro" 


"146 


10.462679 




gly" 


"147 


19.803421 




PRO* 


"148 


18.130360 




GLY" 


"149 


47.391853 




asn" 


"150 


60.248917 


45 


THR" 


"151 


87.887985 




VAL" 


"152 


13.870322 




THR" 


"153 


74.664734 




ALA" 


"154 


45.251106 




ILE" 


"155 


2.686934 


50 


LEU 


"156 


28.720940 




asp" 


"157 


110.081253 




arg" 


158 


31.228874 




met" 


159 


1.612160 




gly" 


"160 


38.223858 


55 


asp" 


"161 


46.293152 




ALA" 


"162 


9.877204 




GLY" 


"163 


34.267326 
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PHE 


164 


11.057570 




SER~ 


165 


51.158882 




pro"" 


166 


62.767738 




ASP~ 


167 


75.164917 


5 


GLU 


168 


43.334976 




VAL 


169 


6.365355 




val" 


170 


2.955627 




ASP 


171 


7.004863 




LEU 


172 


1.880854 


10 


LEU" 


173 


3.197691 




ALA" 


174 


0.000000 




ALA" 


175 


1.074774 




HIS 


176 


0.502189 




SER 


177 


0.806080 


15 


LEU 


178 


3.197691 




ALA" 


179 


3.337480 




SER" 


180 


0.466991 




GLN 


181 


2. 122917 




GLU" 


"182 


40.996552 


20 


GLY 


"183 


62.098671 




leu" 


184 


23.954853 




ASN~ 


185 


15.918136 




SER 


186 


95.185318 




ALA" 


187 


59.075272 


25 


ILE 188 


27.675419 




PHE 189 


102.799423 




ARG 


190 


55.265549 




SER" 


"191 


6.986028 




PRO 


'192 


2.686934 


30 


LEU" 


"193 


12.321225 




asp" 


"194 


2.127163 




SER" 


"195 


33.556419 




THR~ 


"196 


33.049286 




pro"" 


197 


20.874798 


35 


GLN" 


"198 


65.729698 




VAL" 


"199 


31.705818 




PHE*" 


"200 


4.753195 




ASP*" 


"201 


13.744506 




thr" 


'202 


1.612160 


40 


GLN" 


"203 


16.081930 




PHE" 


"204 


2.581340 




TYR" 


"205 


1.880854 




ile" 


"206 


9.356181 




GLU" 


"207 


0.735684 


45 


thr" 


"208 


10.685907 




LEU" 


"209 


9.672962 




LEU" 


"210 


2.955627 




LYS" 


"211 


77.176834 




GLY" 


"212 


40.968609 


50 


Tffif 


"213 


78.718216 




thr" 


"214 


21.738384 




GLN' 


"215 


77.622299 




PRO" 


"216 


25.441587 




GLY" 


"217 


8.320850 


55 


PRO" 


"218 


96.972305 




SER" 


"219 


64.627823 




LEU" 


"220 


85.732414 
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GLY 


221 


27-361111 




PHE 


"222 


134.620178 




ALA 


223 


3.873014 




GLU 


224 


12.141763 


5 


GLtT 


225 


65.129868 




LEU 


226 


76.105843 




SER 


227 


0.268693 




pro" 


228 


7.017754 




phe" 


229 


0.000000 


10 


PRO 230 


47.827423 




GLY 


231 


23.790522 




GLU" 


232 


6.643466 




PHE 


233 


6.713862 




ARG 


"234 


18.012030 


15 


MET 


"235 


4.598188 




ARG 


"236 


91.415581 




SER 


'237 


1.982125 




ASP" 


"238 


6.246871 




ALA" 


'239 


12.897283 


20 


LEU" 


"240 


76.820526 




LEU" 


"241 


3.224321 




ALA 


242 


1.400973 




ARG" 


"243 


77.207176 




ASP" 


"244 


36.207306 


25 


ser" 


'245 


104. 023796 




arg" 


"246 


121.852341 




thr" 


"247 


2.955627 




ALA" 


"248 


4.810700 




CYS" 


'249 


47.331306 


30 


ARG" 


"250 


62.062778 




TRP 251 


2.418241 




GLN 


252 


5.554953 




SER" 


"253 


38.284832 




MET" 


"254 


1.124224 


35 


thr" 


"255 


0.000000 




ser" 


"256 


53.758987 




SER 


"257 


37.276134 




asn" 


'258 


44.381340 




GLU~ 


"259 


149.565140 


40 


VAL~ 


"260 


57.500389 




met" 


"261 


2.679314 




GLY~ 


"262 


10.175152 




GLN" 


"263 


107.458916 




ARG" 


264 


36.402130 


45 


tyr" 


"2 65 


0.233495 




arg" 


"2 66 


91.179619 




ALA" 


"2 67 


53.708500 




ALA" 


"268 


6.504294 




MET" 


'269 


17.122011 


50 


ALA" 


"270 


22.455158 




LYS" 


271 


73.386177 




MET 


"272 


3.959508 




SER" 


"273 


15.043281 




val" 


274 


23.887930 


55 


LEU 


"275 


17.196379 




GLY" 


276 


44.362202 




phe" 


"277 


68.062485 
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ASP 


278 


94,902039 




ARG~ 


279 


113.549011 




ASN~ 


280 


134.886017 




ALA 281 


72.340973 


5 


LEU 282 


26.692348 




THR 


283 


27.696728 




ASP 


'284 


72.214157 




CYS 285 


0.000000 




SER 


286 


28.209335 


10 


ASP 


287 


64.560753 




VAL~ 


'288 


7.040061 




ILE" 


'289 


8.665112 




pro" 


"290 


48.682365 




SER 


'291 


86.141670 


15 


ALA"" 


'292 


29.031240 




VAL 


"293 


84.432014 




SER 


'294 


85.944153 




asn" 


295 


49.017288 




asn" 


"296 


133.459198 


20 


ALA 


"297 


57.283794 




ALA 


298 


65.233749 




PRO" 


"299 


24.751518 




val" 


"300 


45.409184 




ile" 


"3 01 


8.060802 


25 


pro" 


"302 


14.742939 




gly" 


"303 


16.589832 




gly" 


"304 


34.238071 




leu" 


'305 


24.719791 




thr" 


"306 


49.356300 


30 


val 


"307 


71.491821 




asp" 


"308 


130.906174 




asp" 


"309 


31.733070 




ile" 


"310 


19.581894 




GLU" 


"311 


81.414574 


35 


val" 


"312 


94.769890 




SER 


"313 


39.688896 




CYS" 


"314 


9.998511 




pro" 


"315 


120.328018 




SER" 


"316 


95.364319 


40 


GLU 


"317 


65.560959 




PRO" 


"318 


100.254364 




PHE" 


"319 


46.284115 




PRO" 


"320 


31.328060 




GLU" 


"321 


177.602249 


45 


ile" 


"322 


33.449741 




ALA" 


"323 


46.892982 




THR" 


"324 


79.976471 




ALA" 


"325 


36.423820 




SER" 


"326 


124.467422 


50 


gly" 


"327 


28.219524 




pro" 


"328 


107.553696 




leu" 


"329 


86.789825 




pro" 


"330 


34.287163 




SER" 


"331 


75.764053 


55 


LEU 


"332 


32.840569 




ALA" 


"333 


61.516434 




PRO' 


"334 


82.389992 
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ALA_335 

PRO_33 6 

HEMJ337 

CA_338 

CA_339 

NAG_340 

NAG 341 



10 



15 



20 



25 



6.246871 
56.750813 
60.435017 
2.078997 
0.000000 
141.534668 
186.311371 
Subset REST: 

restmole. list 
Subset REST: 

ARP: 9 ,69-70, 125, 127 ,133 ,299-301, 334-336 

restatom. list 
Subset REST: 

ARP: SER 9 : N, CA, C, O, CB,OG 
ARP : GLY 69:N,CA,C,0 

ARP : GLN 70:N,CA,C,0,CB,CG,CD,OE1,NE2 
ARP : GLY 125:N,CA,C,0 
ARP : SER 127:N r CA,C,0,CB,OG 
ARP:PRO 133:N,CA,CD,C,0,CB,CG 
ARP: SER 299:N,CA,C,0,CB,OG 
ARP: ALA 300:N,CA,C,O,CB 
ARP: VAL 301:N,CA,C,O,CB,CGl,CG2 
ARP : SER 334:N,CA,C,0,CB,OG 
ARP : GLY 335:N,CA,C,0 
ARP:PRO 336:N,CA,CD,C,0,CB,CG 
Subset SUB5B: 

sub5bmole. list 
Subset SUB5B: 

ARP: 10-11,34, 38,65-68,71-72,120-121, 123-124, 

30 128-132,134,270,274, 

ARP: 297-298; 302-303,311-312,332-333, 337-338 
sub5batom. list 
Subset SUB5B: 

N,CA,C,0,CB,CG1,CG2 
N,CA,C,0,CB,0G1,CG2 
N,CA,C,0,CB,CG,CD,0E1,NE2 
N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
N,CA,C,0,CB,CG,CD1,CD2 
N,CA,C,0,CB,0G1,CG2 
N,CA,C,0,CB 
N,CA,C,0,CB 

N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
N,CA,C,0 

N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
N,CA,C,0,CB 
N,CA,C,0,CB 
N,CA,C,0,CB,CG1,CG2 
N,CA,C,0,CB,CG,0D1,ND2 
N,CA,C,0,CB,SG 
N,CA,CD,C,0,CB,CG 
N,CA,C,0 
N,CA,C,0,CB,OG 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
N CA C O 

N^CAic'o,CB,CG,CD,NE,CZ,NHl,NH2 
N , CA , C , O , CB , CG 1 , CG2 , CD 1 
N,CA,CD,C,0,CB,CG 



35 



40 



45 



50 



55 



ARP: VAL 10 
ARP : THR 11 
ARP: GLN 34 
ARP : TYR 38 
ARP: LEU 65 
ARP: THR 66 
ARP: ALA 67 
ARP: ALA 68 
ARP : PHE 71 
ARP: GLY 72 
ARP: PHE 120 
ARP: ALA 121 
ARP: ALA 123 
ARP: VAL 124 
ARP: ASN 128 
ARP:CYS 129 
ARP:PRO 130 
ARP : GLY 131 
ARP: SER 132 
ARP: ARG 134 
ARP: GLY 270 
ARP: ARG 274 
ARP: ILE 297 
ARP: PRO 298 



- WO 98/35026 



PCT/DK98/00046 



83 



ARP: SER 302 :N, CA, C,0 , CB, OG 
ARP: ASN 303 :N, CA, C,0, CB, CG,ODl ,ND2 
ARP : GLY 311:N,CA,C,0 
ARP:GLY 312:N,CA,C,0 
5 ARPtTHR 332:N,CA,C,0,CB,0G1,CG2 

ARP: ALA 333:N,CA,C,0,CB 
ARP: LEU 337:N,CA,C, 0,CB,CG,CD1,CD2 
ARP:PRO 338:N,CA,CD,C,0,CB,CG 
Subset ACTSITE: 
10 actsitemole. list 
Subset ACTSITE: 

ARP: 44-61 , 75-77 , 79-80 , 87-88 ,90-96 , 

99 , 118 , 122 , 126 , 135, 148-149 , 152-158 , 
ARP : 163-164 , 167 , 176-194 , 197-205 , 207-209 , 2 11- 
15 213,216,230-231,241, 

ARP: 243-246, 249, 259, 273, 277, 280, 343-347H 

actsiteatom. list 
Subset ACTSITE: 

ARP : GLU 44:N,CA,C,0,CB,CG,CD,0E1,0E2 

20 ARP : SER 45:N,CA,C,0,CB,0G 

ARP:PRO 46:N,CA,CD,C,0,CB,CG 

ARP : VAL 47:N,CA,C,0,CB,CG1,CG2 

ARP : ARG 48:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

ARP:LYS 49 : N, CA, C, O, CB, CG, CD, CE,NZ 

25 ARP: ILE 50:N,CA,C,0,CB,CG1,CG2,CD1 

ARP: LEU 51:N,CA,C,0,CB,CG,CD1,CD2 
ARP: ARG 52:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
ARP: ILE 53:N,CA,C,0,CB,CG1,CG2,CD1 
ARP :VAL 54:N,CA,C,0,CB,CG1,CG2 

30 ARP : PHE 55:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

ARP:HIS 56:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
ARP:ASP 57:N,CA,C,0,CB,CG,OD1,OD2 
ARP: ALA 58:N,CA,C,0,CB 
ARP: ILE 59:N,CA,C,0,CB,CG1,CG2,CD1 

35 ARP: GLY 60:N,CA,C,0 

ARP : PHE 61:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

ARP : GLY 75:N,CA,C,0 

ARP: ALA 76:N,CA,C,0,CB 

ARP:ASP 77:N,CA,C,0,CB,CG,0D1,0D2 

40 ARP : SER 79:N,CA,C,0,CB,OG 

ARP: ILE 80:N,CA,C,0,CB,CG1,CG2,CD1 

ARP: GLU 87:N,CA,C,0,CB,CG,CD,OE1,OE2 

ARP : LEU 88:N,CA,C,0,CB,CG,CD1,CD2 

ARP: PHE 90:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

45 ARP:PRO 91:N,CA,CD,C,0,CB,CG 

ARP: ALA 92:N,CA,C,0,CB 
ARP: ASN 93:N,CA,C,0,CB,CG,0D1,ND2 
ARP: GLY 94:N,CA,C,0 
ARP : GLY 95:N,CA,C,0 

50 ARP : LEU 96:N,CA,C,0,CB,CG,CD1,CD2 

ARP:THR 99:N,CA,C,0,CB,0G1,CG2 
ARP: ILE 118:N,CA,C,0,CB,CG1,CG2,CD1 
ARP:THR 122:N,CA,C,0,CB,0G1,CG2 
ARP: MET 126:N,CA,C,0,CB,CG,SD,CE 

55 ARP: LEU 135:N,CA,C,0,CB,CG,CD1,CD2 

ARP : SER 148:N,CA,C,0,CB,0G 
ARP:PR0 149:N,CA,CD,C,0,CB,CG 
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ARP: 


LEU 


152: 


N, 


CA, 


c, 


o, 




ARP: 


ILE 


153: 


N, 


CA, 


c, 






ARP: 


PRO 


154: 


N, 


CA, 


CD,C 




ARP: 


GLY 


155: 


N, 


CA, 


c, 


0 


5 


ARP: 


PRO 


156: 


N, 


CA, 


CD,C 




ARP: 


GLY 


157: 


N, 


CA, 


c, 


0 




ARP: 


ASN 


158: 


N, 


CA, 


c, 


o, 




ARP: 


ILE 


163: 


N, 


CA, 


c, 






ARP: 


LEU 


164: 


N, 


CA, 


c, 


o, 


10 


ARP: 


MET 


167: 


N, 


CA, 


c, 


o, 




ARP: 


GLU 


176: 


N, 


CA, 


c, 


o, 




ARP: 


VAL 


177: 


N, 


CA, 


c, 






ARP: 


VAL 


178: 


N, 


CA, 


c, 


o, 




ARP: 


ASP 


179: 


N, 


CA, 


c, 


o, 


15 


ARP: 


LEU 


180: 


N, 


CA, 


C, 


o, 




ARP: 


LEU 


181: 


N, 


CA, 


c, 


o, 




ARP: 


ALA 


182: 


N, 


CA, 


c, 


o, 




ARP: 


ALA 


183: 


N, 


CA, 


c, 


o, 




ARP: 


HIS 


184: 


N, 


CA, 


c, 




20 


ARP: 


SER 


185: 


N, 


CA, 


c, 


o, 




ARP: 


LEU 


186: 


N, 


CA, 


c, 


o, 




ARP: 


ALA 


187: 


N, 


CA, 


c, 


o, 




ARP: 


SER 


188: 


N, 


CA, 


c, 


o, 




ARP: 


GLN 


189: 


N, 


CA, 


c, 




25 


ARP: 


GLU 


190: 


N, 


CA, 


c, 






ARP: 


:GLY 


191: 


N, 


CA, 


c, 


0 




ARP: 


:LEU 


192: 


N, 


CA # 


c, 


o, 




ARP: 


'ASN 


193: 


N, 


CA, 


c, 


o, 




ARP: 


:SER 


194: 


>N, 


CA, 


c, 


o, 


30 


ARP 


:PHE 


197: 


:N, 


CA, 


c, 


o, 




ARP' 


: ARG 


198: 




CA, 


c, 






ARP: 


:SER 


199: 




CA, 


c, 


o, 




ARP 


:PRO 


200: 


:N, 


,CA, 


CD,C 




ARP 


:LEU 


201* 


:N, 


,CA, 


c, 


o, 


35 


ARP 


:ASP 


202 


:N, 


,CA, 




o, 




ARP 


:SER 


203: 


;N, 


,CA, 


c, 


o, 




ARP 


:THR 


204, 


:N, 


,CA, 


c, 


o, 




ARP 


:PRO 


205 


:N, 


,CA, 


,CD,C 




ARP 


:VAL 


207 


:N, 


rCA, 






40 


ARP 


:PHE 


208 


:N 


,CA, 




o, 




ARP 


:ASP 


209 


:N 


rCA, 




o, 




ARP 


: GLN 


211 


:N 


,CA, 




o, 




ARP 


:PHE 


212 


:N 


,CA 




( o r 




ARP 


:TYR 


213 


:N 


r CA 






45 


ARP 


: THR 


216 


:N 


f CA 


f c 


r o, 




ARP 


:PHE 


230 


:N 


r CA 




f 0, 




ARP 


: ALA 


231 


:N 


,CA 




,0, 




ARP 


:PHE 


241 


:N 


,CA 


F c 


,0, 




ARP 


:MET 


243 


:N 


f CA 




rO, 


50 


ARP 


: ARG 


244 


:N 


r CA 


,c 


rO, 




ARP 


:SER 


245 


:N 


f CA 


f c 


rO, 




ARP 


:ASP 


246 


:N 


, CA 


f c 


r 0, 




ARP 


:LEU 


249 


:N 


, CA 


r c 


rO, 




ARP 


:TRP 


259 


:N 


,CA 


f c 


r 0, 


55 






CD2 ( 


NE1,CE2 




ARP 


:TYR 


273 


:N 


,CA 


fC 


rO, 




ARP: MET 


277 


:N 


,CA 


,c 


,0, 



,0,CB,CG1,CG2 
,0,CB,CG,0D1,0D2 
,0,CB,CG,CD1,CD2 
,0,CB,CG,CD1,CD2 
,0,CB 



,0E2 



CB 

CB,CG 



CB 



CB,CG, 



),0E1,NE2 
),0E1,0E2 

D1,CD2 



CB,0G 
,0,CB,CG 
CB,CG,CD1,C 
CB,CG,0D1,C 
CB,0G 
CB,0G1,CG2 

:,o,cb,cg 

CB,CG1,CG2 
CB.CG.CD1.C 



CE1,CE2,CZ 
,NH1,NH2 



CB 
CB 



CB 
CB 
CB 
CB 



B,CG,ODl,OD2 
B,CG,CD,0E1,NE2 
B , CG , CD1 , CD 2 , CE1 , CE2 , CZ 
B , CG , CD1 , CD 2 , CE1 , CE2 , CZ , 
B,0G1,CG2 

B , CG, CD1 , CD 2 , CE1 , CE2 , CZ 

>,CG,CD1,CD2,CE1,CE2,CZ 
*,CG,SD,CE 

t r CG,CD,NE,CZ,NHl,NH2 
,0G 



,CD1, 



,CE2,CZ,0H 
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ARP:MET 280:N,CA,C,O, CB, CG,SD,CE 
ARP: ALA 343:N,CA,C,0,CB 
ARP:PRO 344:N,CA,CD,C,0,OXT, CB f CG 
ARP: HEM 345H: FE,NA, NB,NC,ND, CHA, CHB, 
5 CHC , CHD , CIA , C2 A , C3 A , C4 A , CMA , CAA , CBA , CGA 

ARP: HEM 345H:01A, 02A, C1B, C2B, C3B, C4B, CMB, 

CAB, CBB ,010,020, C3C, C4C, CMC, CAC, CBC 
ARP: HEM 345H:C1D,C2D,C3D,C4D,CMD,CAD,CBD,CGD,01D,02D 

ARP:CA 346H:CA 
10 ARP:CA 347H:CA 

Subset RESTx: 

restxmole. list 
Subset RESTX 

NEWMODEL: 9 ,334-336 
15 restxatom. list 
Subset RESTX: 

NEWMODEL: SER 9:N,CA,C,0,CB,OG 
NEWMODEL: SER 334 :N, CA, C, O, CB,OG 
NEWMODEL: GLY 335:N,CA,C,0 
20 NEWMODEL: PRO 336:N,CA,CD,C,0,CB,CG 



Example 5 

Activation of mPEG 15,000 with N-succinimidvl carbonate 

25 mPEG 15,000 was suspended in toluene (4 ml/g of mPEG) 20% was 

distilled off at normal pressure to dry the reactants 
azeotropically. Dichloromethane (dry 1 ml/g mPEG) was added when 
the solution was cooled to 30°C and phosgene in toluene (1,93 M 5 
mole/mole mPEG) was added and mixture stirred at room temperature 

30 over night. The mixture was evaporated to dryness and the desired 
product was obtained as waxy lumps. 

After evaporation dichloromethane and toluene (1:2, dry 3 
ml/g mPEG) was added to re-dissolve the white solid. N-Hydroxy 
succinimide (2 mole/mole mPEG.) was added as a solid and then 

3 5 triethylamine (1.1 mole/mole mPEG) . The mixture was stirred for 3 
hours, initially unclear, then clear and ending with a small 
precipitate. The mixture was evaporated to dryness and 
recrystallised from ethyl acetate (10 ml) with warm filtration to 
remove salts and insoluble traces. The blank liquid was left for 

40 slow cooling at ambient temperature for 16 hours and then in the 
refrigerator over night. The white precipitate was filtered and 
washed with a little cold ethyl acetate and dried to yield 98 % 
(w/w) . NMR Indicating 80 - 90% activation and 5 o/oo (w/w) 
HNEt 3 Cl. 1 H-NMR for mPEG 15,000 (CDC1 3 ) d 1.42 t (1= 4.8 CH 3 i 

45 HNEt 3 Cl) , 2.84 s (1= 3.7 succinimide), 3.10 dq (1= 3.4 CH 2 i 
HNEt 3 Cl), 3.38 s (1= 2.7 CH 3 i OMe) , 3.40* dd (I = 4.5 0/00, 13 C 
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satellite), 3.64 bs (I = 1364 main peak), 3.89* dd (I = 4.8 o/oo , 
13 C satellite), 4.47 dd (I = 1.8, CH 2 in PEG) . No change was seen 
after storage in a desiccator at 22 °C for 4 months. 

5 Example 6 

Activation of mPEG 5.000 with N-succi nimidvl carbonate 

Activation of mPEG 5,000 with N-succinimidyl carbonate was 
performed as described in Example 5. 

10 EXAMPLE 7 

Construction and expression of PD4 98 variants: 

PD498 site-directed variants were constructed using the "maxi- 

oligonucleotide-PCR M method described by Sarkar et al., (1990): 

BioTechniques 8: 404-407. 
15 The template plasmid was shuttle vector pPD498 or an analogue 

of this containing a variant of the PD498 protease gene. 

The following PD498 variants were constructed, expressed and 

purified. 

A: R28K 
20 B: R62K 

C: R169K 

D: R28K + R62K 

E: R28K + R169K 

F: R62K + R169K 
25 G: R28K+R69K+R169K 

Construction of variants 

For introduction of the R28K substitution a synthetic 

oligonucleotide having the sequence: GGG ATG TAA CCA AGG GAA GCA 
30 GCA CTC AAA CG (SEQ ID NO. 7) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 

prepared by Bst E II and Bgl II digestion. Positive variants were 

recognized by Styl digestion and verified by DNA sequencing of the 

total 769 bp insert. 
35 For introduction of the R62K substitution a synthetic 

oligonucleotide having the sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
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prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by Clal digestion and verified by DNA sequencing of the 
total 769 bp insert. 

For introduction of the R169K substitution a synthetic 
5 oligonucleotide having the sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by the absence of a Rsa I restriction site and verified 
10 by DNA sequencing of the total 769 bp insert. 

For simultaneously introduction of the R28K and the R62K 
substitutions, synthetic oligonucleotides having the sequence: 
GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 7) and the 
sequence: 

15 CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and verified 
by DNA sequencing of the total 769 bp insert. 

2 0 For simultaneously introduction of the R28K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: GGG 
ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 8) and the 
sequence : 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 8) were used 
25 simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert. 

30 For simultaneously introduction of the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: CGA 
CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the sequence: 
CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 

35 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Clal digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert 
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For simultaneously introduction of the R28K, the R62K and the 
R169K substitutions, synthetic oligonucleotides having the 
sequence : 

GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID No. 7), the 
5 sequence : 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the 
sequence : 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
10 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and absence of 
a Rsa I site. The variant was verified by DNA sequencing of the 
total 769 bp insert. 

15 Fermentation, expression and purification of PD4 98 variants 

Vectors hosting the above mentioned PD498 variants were 
purified from E. coli cultures and transformed into B . subtilis in 
which organism the variants were fermented, expressed and purified 
as described in the "Materials and Methods" section above. 

20 

Example 7 

Conjugation of triple substitited PD498 variant wi th activated 
mPEG 5,000 

200 mg of triple substituted PD498 variant (i.e. the 
25 R28K+R62K+R169K substituted variant) was incubated in 50 mm 
NaBorate, pH 10, with 1.8 g of activated mPEG 5,000 with N- 
succinimidyl carbonate (prepared according to Example 2) , in a 
final volume of 20 ml. The reaction was carried out at ambient 
temperature using magnetic stirring. Reaction time was 1 hour. The 
30 reaction was stopped by adding DMG buffer to a final concentration 
of 5 mM dimethyl glutarate, 1 mM CaCl 2 and 50 mM borate, pH 5.0. 

The molecule weight of the obtained derivative was approxi- 
mately 120 kDa, corresponding to about 16 moles of mPEG attached 
per mole enzyme. 

35 Compared to the parent enzyme, residual activity was close to 

100% towards peptide substrate (succinyl-Ala-Ala-Pro-Phe-p- 
Nitroanilide) . 
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Example 8 

Alleraenicitv trails of PD498 variant-SPEG5 , 000 in guinea pigs 

Dunkin Hartley guinea pigs are stimulated with 1.0 ^g PD498- 
SPEG 5,000 and 1.0 ng modified variant PD498-SPEG 5,000 by 
5 intratracheal installation. 

Sera from immunized Dunkin Hartley guinea pigs are tested 
during the trail period in a specific IgGi ELISA (described above) 
to elucidate whether the molecules could activate the immune 
response system giving rise to a specific IgGx response indicating 
10 an allergenic response. 

The IgGi levels of Dunkin Hartley guinea pigs during the trail 
period of 10 weeks are observed. 

Example 9 

15 Suitable substitutions in Humicola lanuginosa lipase for 

addition of amino attachment groups f-NHo ) 

The 3D structure of Humicola lanuginosa lipase (SEQ ID NO 6) 

is available in Brookhaven Databank as ltib.pdb. The lipase 

consists of 269 amino acids. 
20 The procedure described in Example 1 was followed. The 

sequence of H. lanuginosa lipase is shown below in the table 

listing solvent accessibility data for H. lanuginosa lipase. 

H. lanuginosa residue numbering is used (1-269) , and the active 

site residues (functional site) are S146, S201 and H258. The 
25 synonym TIB is used for H . lanuginosa lipase. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeKzone.bcl and makeKzone2 .bcl below: 



Conservative substitutions: 

3 0 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 255,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/residue 10 
Color_Subset 255,255,0 

35 4 Zone Subset NTERM :1:N Static monomer /residue 10 
Color_Subset 255,255,0 

5 #N0TE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 146, 201, 258 Static monomer/residue 8 
40 ColorjSubset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #N0TE: editnextline object name according to the protein 
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10 Combine Subset REST Difference TIB ALLZONE 

11 List Subset REST Atom Output_File restatom. list 

12 List Subset REST monomer/ residue Output JFile restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 14 List Subset ACTSITE Atom Output File actsiteatom. list 

15 List Subset ACTSITE monomer/ residue Output_File 
actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
10 Color_Subset 

18 Combine Subset SUB 5 A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB5A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

15 21 List Subset SUB5B Atom Output File sub5batom. list 

22 List Subset SUB5B monomer /residue Output_File sub5bmole. list 

23 #Now identify sites for lys->arg substitutions and continue 
with make zone 2 .be 1 

24 #Use grep command to identify ARG in restatom- list, 
20 sub5batom. list & accsiteatom. list 

Comments : 

In this case of E. lanuginosa (=TIB) , REST contains the 
Arginines Argl3 3, Argl39, Argl60, Argl79 and Arg 209, and SUB5B 
25 contains Argll8 and R125. 

These residues are all solvent exposed. The substitutions 
R133K, R139K, R160K, R179K, R209K, R118K and R125K are 
identified in TIB as sites for mutagenesis within the scope of 
this invention. The residues are substituted below in section 
30 2, and further analysis done. The subset ACTSITE contains no 
lysines. 



Non-conservative substitutions: 
makeKzone2 . bcl 

35 1 #sourcefile makezone2 .bcl Claus von der Osten 961128 

2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 
40 5 Copy Object -To_Clipboard -Displace TIB newmodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On TIB 

9 #NOTE: editnextlines with lys->arg positions 
45 10 Replace Residue newmodel: 118 lys L 

11 Replace Residue newmodel: 125 lys L 

12 Replace Residue newmodel: 133 lys L 

13 Replace Residue newmodel: 139 lys L 

14 Replace Residue newmodel: 160 lys L 
50 15 Replace Residue newmodel: 179 lys L 

16 Replace Residue newmodel: 209 lys L 
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17 # 

18 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

19 Color Molecule Atoms newmodel Specified Specification 
5 255,0,255 

20 Zone Subset LYSx newmodel: lys:NZ Static monomer/residue 10 
Color_Subset 255,255,0 

21 Zone Subset NTERMx newmodel: 1:N Static monomer/ residue 10 
Color_Subset 255,255,0 

10 22 #N0TE: editnextline ACTSITEx residues according to the 
protein 

23 Zone Subset ACTSITEx newmodel: 146, 201, 258 Static 
monomer /residue 8 Color_Subset 255,255,0 

24 Combine Subset ALLZONEx Union LYSx NTERMx 

15 25 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

26 Combine Subset RESTx Difference newmodel ALLZONEx 

27 List Subset RESTx Atom Output_File restxatom. list 

28 List Subset RESTx monomer/residue OutputJFile 
restxmole. list 

20 29 # 

30 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

31 List Subset ACTSITEx Atom OutputJFile act sitexatom. list 

32 List Subset ACTSITEx monomer /residue OutputJFile 
25 actsitexmole.list 

33 # 

34 #read restxatom. list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

3 0 Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 18,31-33,36,38,40,48,50,56-62,64,78,88,91-93,104- 
106,120,136,225,227-229,250,262,268. Of these three are 
Cysteines involved in disulfide bridge formation, and 
35 consequently for structural reasons excluded from the residues 
to be mutated. The following mutations are proposed in H. 
lanuginosa lipase (TIB) : 

A18K,G31K,T32K,N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
40 V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 
Relevant data for Example 2 : 

# TIBNOH20 

# residue area 
GLU_1 110.792610 

45 VAL_2 18.002457 

SER_3 53.019516 

GLN_4 85.770164 

ASP_5 107.565826 

LEU_6 33.022659 

50 PHE_7 34.392754 

ASN 8 84.855331 
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GLN 


9 


39.175591 




PHE~ 


10 


2.149547 




ASN" 


11 


40.544380 




LEU 


12 


27.648788 


5 


PHE 


13 


2.418241 




ALA 


14 


4.625293 




GLN 


15 


28.202387 




TYR~ 


16 


0.969180 




SER~ 


17 


0.000000 


10 


ALA~ 


18 


7.008336 




ALA 


19 


0.000000 




ALA 


2 0 


0.000000 




TYR 


21 


6.947358 




CYS 


22 


8.060802 


15 


GLY~ 


23 


32.147034 




LYS 


"24 


168.890747 




ASN 


25 


8.014721 




ASN 


26 


11.815564 




ASP 


"27 


92.263428 


20 


ALA 


"28 


18.206699 




pro" 


29 


83.188431 




ALA 


"30 


69.428421 




gly" 


'31 


50.693439 




thr"" 


"32 


52.171135 


25 


ASN" 


"33 


111.230743 




ILE" 


"34 


2.801945 




thr" 


"35 


82.130569 




CYS" 


"36 


17.269245 




thr" 


"37 


96.731941 


30 


gly" 


"38 


77.870995 




asn" 


"39 


123.051003 




ala" 


"40 


27.985256 




CYS" 


"41 


0.752820 




pro" 


"42 


46.258949 


35 


GLU" 


"43 


69.773987 




VAL" 


"44 


0.735684 




GLU" 


'45 


77.169510 




LYS" 


"46 


141.213562 




ALA" 


"47 


10.249716 


40 


ASP" 


"48 


109.913902 




ALA" 


"49 


2.602721 




thr" 


"50 


32.012184 




phe" 


"51 


8.255627 




leu" 


"52 


60.093613 


45 


tyr' 


"53 


77.877937 




ser" 


"54 


26.980494 




phe" 


"55 


10.747735 




GLU' 


56 


112.689758 




asp" 


"57 


92.064278 


50 


ser" 


"58 


32.990780 




gly" 


"59 


53.371807 




VAL 


"60 


83.563644 




gly" 


"61 


69.625633 




asp" 


"62 


75.520988 


55 


val" 


"63 


4.030401 




thr" 


"64 


8.652839 




gly" 


"65 


0.000000 
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PHE 


66 


0.268693 




LEU 


*67 


11.822510 




ALA 


"68 


0.537387 




LEU" 


"69 


30.243870 


5 


ASP 


"70 


0.000000 




ASN~ 


"71 


84.101044 




THR 


12 


89.271126 




ASN" 


73 


70.742401 




LYS~ 


74 


98.319168 


10 


LEU" 


75 


8.329495 




ILE 


"76 


5.197878 




VAL 


"77 


0.806080 




LEU 


"78 


5.293978 




SER~ 


"79 


0.000000 


15 


phe" 


80 


2.079151 




arg" 


"81 


41.085312 




gly" 


"82 


1.471369 




ser" 


"83 


43.794014 




arg" 


"84 


100.261627 


20 


ser" 


"85 


70.607552 




ile" 


"86 


59.696865 




glu" 


"87 


136.510773 




asn" 


"88 


119.376373 




TRP 


"89 


102.851227 


25 


ile" 


"90 


78.068588 




GLY 


"91 


60.783607 




ASN" 


"92 


45.769428 




leu" 


"93 


134.228363 




asn" 


"94 


101.810959 


30 


phe" 


"95 


41.212212 




ASP 96 


79.645950 




LEU 


97 


25.281572 




LYS" 


"98 


88.840263 




GLU" 


"99 


132.377090 


35 


ILE" 


"100 


9.135575 




asn" 


"101 


63.444527 




asp" 


"102 


88.652847 




ile" 


"103 


33.470661 




CYS" 


"104 


11.553816 


40 


ser" 


"105 


99.461174 




gly" 


"106 


40.325161 




CYS" 


107 


4.433561 




arg" 


"108 


97.450104 




gly" 


"109 


1.343467 


45 


his" 


"no 


4.652464 




asp" 


"ill 


37.023655 




gly" 


"112 


29.930408 




phe" 


"113 


14.976435 




thr" 


114 


10.430954 


50 


ser" 


"115 


40.606895 




ser' 


116 


13.462922 




TRP" 


"117 


10.747735 




arg" 


"118 


114.364281 




ser" 


"119 


46.880249 


55 


VAL 


120 


13.434669 




ALA 121 


18.258261 




ASP 122 


110.753098 
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THR_123 
LEU_124 
ARGJL25 
GLN_12 6 
5 LYS_127 
VAL_128 
GLU_129 
ASP_130 
ALA_131 

10 VAL_132 
ARG_133 
GLU_134 
HISJL35 
PRO_136 

15 ASP_137 
TYR_138 
ARGJL39 
VALJL40 
VALJL41 

20 PHEJL42 
THRJL43 
GLY_144 
HIS_145 
SERJL46 

25 LEU_147 
GLY_148 
GLY_149 
ALA_150 
LEU_151 

30 ALAJL52 
THR_153 
VAL_154 
ALA_155 
GLY_156 

35 ALA__157 
ASP_158 
LEU_159 
ARG_160 
GLY_161 

40 ASNJL62 
GLY_163 
TYR_164 
ASP_165 
ILE_166 

45 ASPJL67 
VAL_168 
PHE_169 
SERJL70 
TYR_171 

50 GLY_172 
ALA_173 
PRO_174 
ARG__175 
VAL_176 

55 GLY_177 
ASN_178 
ARG 179 



69.641922 

17.090784 

73.929977 

101.320190 

84.450241 

6.448641 

47.700993 

75.529091 

11.340775 

27.896025 

153.136490 

132.140594 

54.553406 

97.386963 

22.653191 

35.392658 

74.321243 

10.173222 

0.233495 

3.224321 

0.000000 

0.000000 

4.514527 

15.749787 

40.709171 

0.000000 

0.000000 

0.537387 

22.838938 

0.268693 

18.078798 

7.254722 

0.000000 

0.000000 

15.140230 

41.645477 

6.144750 

41.939716 

68.978180 

68.243805 

79.181274 

36.190247 

103.068283 

0.000000 

24.326443 

4.299094 

0.466991 

3.339332 

0.000000 

0.000000 

12.674671 

13.117888 

10.004488 

21.422220 

2.680759 

21.018063 

110.282166 
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ALAJL80 
PHE_181 
ALA_182 
GLU_183 
5 PHE_184 
LEU_185 
THR_186 
VAL_187 
GLN_188 

10 THR_189 
GLY_190 
GLY_191 
THR_192 
LEU_193 

15 TYR_194 
ARG_195 
ILE_196 
THR_197 
HIS_198 

20 THR_199 
ASN_200 
ASP_201 
ILE_202 
VAL_203 

25 PRO_204 
ARG_205 
LEU_206 
PRO_207 
PRO_208 

30 ARG_209 
GLU_210 
PHE_211 
GLY_212 
TYR_213 

35 SER_214 
HIS_215 
SER_216 
SER_217 
PRO_218 

40 GLU_219 
TYR_220 
TRP_221 
ILE_222 
LYS_223 

45 SER_224 
GLY_225 
THR_226 
LEU_227 
VAL_228 

50 PRO_229 
VAL_230 
THR_231 
ARG_232 
ASN_233 

55 ASP_234 
ILE_235 
VAL 236 



33.210381 

4.567788 

3.897251 

76.354004 

71.225983 

24.985012 

47.023815 

98.244606 

54.152954 

88.660645 

24.792120 

10.726818 

45.458744 

16.633211 

34.829491 

29.030851 

1.973557 

3.493014 

1.532270 

34.785877 

39.789238 

0.000000 

31.168434 

29.521076 

3.515322 

44.882454 

51.051746 

12.575329 

43.259636 

113.700233 

154.628540 

112.505188 

30.084938 

3.268936 

12.471436 

23.354481 

16.406200 

14.665598 

17.240993 

13.145291 

18.718306 

39.229233 

5.105175 

120.739983 

15.407301 

29.306646 

66.806862 

122.682808 

60.923004 

104. 620377 

23.398251 

63.372971 

80.357857 

89.255066 

43.011250 

2.114349 

45.140491 
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LYS 


237 


105.651306 




ILE" 


"238 


24.671705 




GLU" 


"239 


116.891907 




GLY" 


"240 


31.965794 


5 


ile" 


"241 


46.278099 




asp" 


"242 


28.963699 




ALA" 


"243 


25.158146 




thr" 


"244 


98.351440 




gly" 


"245 


43.842186 


10 


gly" 


"24 6 


0.700486 




ASN 


"247 


3.926274 




asn" 


"248 


51.047890 




gln" 


"249 


66.699188 




pro" 


"250 


132.414047 


15 


asn" 


"251 


70.213730 




ile" 


252 


141.498062 




pro" 


"253 


59.089233 




asp" 


"254 


59.010895 




ile" 


"255 


63.298943 


20 


pro" 


"256 


78.608688 




ala" 


"257 


0.806080 




his" 


'258 


3.761708 




leu" 


259 


50.747856 




TRP" 


260 


35.229710 


25 


tyr" 


"261 


5.440791 




phe" 


'262 


36.457939 




gly" 


"263 


22.071375 




LEU 264 


109.148178 




ILE 


265 


2.418241 


30 


gly" 


"266 


17.730062 




thr" 


"267 


68.217873 




CYS" 


"268 


15.418195 




leu" 


"269 


165.990997 



Subset REST: 
35 restmole.list 
Subset REST: 

TIB: 5, 8-9, 13-14, 16, 18-20, 31-34, 36, 38, 40, 48-50, 56- 
66,68,76-79,88,91-93, 

TIB: 100-107, 116-117, 119-121,132-134, 13 6, 139-142, 154- 
40 169,177-185, 

TIB: 187, 189-191, 207-212, 214-216, 225, 227-229, 241- 

244,250,262,268 
restatom. list 
Subset REST: 
45 TIB:ASP 5:N,CA,C,0,CB,CG,ODl,OD2 

TIB: ASN 8:N,CA,C,0,CB,CG,0D1,ND2 

TIB: GLN 9:N,CA,C,0,CB,CG,CD,0E1,NE2 

TIB: PHE 13:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

TIB: ALA 14 :N, CA, C,0, CB 
50 TIB: TYR 16 :N, CA, C,0, CB, CG, CD1, CD2 , CE1, CE2 , CZ , OH 

TIB: ALA 18:N,CA,C,0,CB 

TIB: ALA 19:N,CA,C,0,CB 

TIB: ALA 20 :N, CA, C, O, CB 

TIB: GLY 31:N,CA,C,0 
55 TIB: THR 32 :N, CA, C, O, CB, OG1, CG2 

TIB: ASN 33 :N, CA, C,0, CB, CG, 0D1,ND2 

TIB : ILE 34 : N, CA, C, O, CB, CGI , CG2 , CDl 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 



CYS 
GLY 
ALA 
ASP 
ALA 
THR 
GLU 
ASP 
SER 
GLY 
VAL 
GLY 
ASP 
VAL 
THR 
GLY 
PHE 
ALA 
ILE 
VAL 
LEU 
SER 
ASN 
GLY 
ASN 
LEU 
ILE 
ASN 
ASP 
ILE 
CYS 
SER 
GLY 
CYS 
SER 
TRP 



CE3,CZ2 
TIB: SER 
TIB: VAL 
TIB: ALA 
TIB: VAL 
TIB:ARG 
TIB: GLU 
TIB : PRO 
TIB : ARG 
TIB: VAL 
TIB: VAL 
TIB: PHE 
TIB: VAL 
TIB: ALA 
TIB: GLY 
TIB: ALA 
TIB: ASP 
TIB: LEU 
TIB: ARG 
TIB: GLY 
TIB: ASN 



36:N,CA, C,0,CB, SG 

38:N,CA,C,0 

40:N,CA, C,0,CB 

48 :N,CA,C, 0,CB,CG,0D1,0D2 

49:N,CA,C,0,CB 

50:N,CA,C,O,CB,OGl,CG2 

56:N,CA,C,0,CB,CG,CD,OEl,OE2 

57:N,CA,C,0,CB,CG,ODl,OD2 

58:N,CA,C,0,CB,OG 

59:N,CA,C,0 

60:N,CA,C,0,CB,CG1,CG2 
61:N,CA,C,0 

62:N,CA,C,0,CB,CG,0D1,0D2 
63:N,CA,C,0,CB,CG1,CG2 
64:N,CA,C,0,CB,0G1,CG2 
65:N,CA,C,0 

66:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

68:N,CA,C,0,CB 

76:N,CA,C,0,CB,CG1,CG2,CD1 

77:N,CA,C,0,CB,CG1,CG2 

78:N,CA,C,0,CB,CG,CD1,CD2 

79:N,CA,C,0,CB,OG 

88:N,CA,C,0,CB,CG,ODl,ND2 

91:N,CA,C,0 

92:N,CA,C,0,CB,CG,0D1,ND2 
93:N,CA,C,0,CB,CG,CD1,CD2 
100:N,CA,C,0,CB,CG1,CG2,CD1 
101:N,CA,C,0,CB,CG,OD1,ND2 
102:N,CA,C,O,CB,CG,ODl,OD2 
103 :N, CA,C,0,CB,CG1,CG2,CD1 
104:N,CA,C,O,CB,SG 
105:N,CA,C,0,CB,OG 
106:N, CA,C,0 
107:N,CA,C,O,CB,SG 
116:N,CA,C,0,CB,0G 

1 17 : N , CA , C , O , CB , CG , CD1 , CD2 , NE1 , CE2 , 

C,0,CB,OG 
C,0,CB,CG1,CG2 
C,0,CB 

C,0,CB,CG1,CG2 
C,0,CB,CG,CD,NE,CZ,NH1,NH2 
C,0,CB,CG,CD,0E1,0E2 



CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 

,CZ3,CH2 
119:N 
120:N 
121:N 
132:N 
133:N 
134:N 
136:N 
139:N 
140:N 
141:N 
142:N 
154:N 
155:N 
156:N 
157:N 
158:N 
159:N 
160:N 
161:N 
162:N 



CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 



CD,C,0,CB,CG 



CB,CG,CD,NE,CZ,NH1,NH2 

CB,CG1,CG2 

CB,CG1,CG2 

CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ 

CB,CG1,CG2 

CB 

CB 

CB,CG,0D1,0D2 
CB,CG,CD1,CD2 
CB,CG,CD,NE,CZ,NH1,NH2 
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TIB: 


, GLY 


163: 


:N, 


CA, 


C,0 






TIB! 


,TYR 


164: 


• N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ,0H 




TIB: 


:ASP 


165: 


'N, 


CA, 


C,0,CB,CG,0D1,0D2 






TIB: 


ILE 


166: 


>N, 


CA, 


C,0,CB,CG1,CG2,CD1 




5 


TIB: 


ASP 


167: 


in, 


CA, 


C,O,CB,CG,0Dl,OD2 






TIB: 


VAL 


168: 




CA, 


C,0,CB,CG1,CG2 






TIB: 


,PHE 


169: 


:N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 




TIB: 


;GLY 


177: 


;N, 


CA, 


c,o 






TIB: 


:ASN 


178: 


:N, 


CA, 


C,0,CB, CG, 0D1,ND2 




10 


TIB: 


ARG 


179: 


:N, 


CA, 


C,O,CB,CG,0D,NE,CZ,NHl 


,NH2 




TIB: 


, ALA 


180: 


in, 


CA, 


,0,0, CB 






TIB: 


;PHE 


181: 


;N, 


CA, 


0,0, CB,CG, CD1,CD2, CE1, 


CE2,CZ 




TIB: 


: ALA 


182: 


:N, 


CA, 


C,O f CB 






TIB: 


;GLU 


183: 


:N, 


CA, 


C,0,CB,CG,CD,0E1,0E2 




15 


TIB: 


:PHE 


184: 


:N, 


CA, 


C,0,CB,CG, CD1,CD2,CE1, 


CE2 , CZ 




TIB: 


:LEU 


185: 


:N, 


CA, 


0 , 0 , CB , CG , CD1 , CD2 






TIB: 


;VAL 


187: 


:N, 


CA, 


C,0,CB,CG1,CG2 






TIB: 


:THR 


189: 


:N, 


CA, 


C,0, CB / 0G1,CG2 






TIB: 


: GLY 


190: 


:N, 


CA, 


0,0 




20 


TIB: 


: GLY 


191; 


;N, 


CA, 


0,0 






TIB: 


:PRO 


207: 


:N, 


CA, 


CD,C,0,CB,CG 






TIB: 


:PRO 


208: 


:N, 


CA, 


CD,C, 0,CB,CG 






TIB: 


: ARG 


209: 


:N, 


CA, 


C,0,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: 


:GLU 


210: 


:N, 


CA, 


,C,O,CB,CG,0D,OEl,OE2 




25 


TIB: 


:PHE 


211: 


:N, 


CA, 


,C,0, CB,CG,CD1, CD2,CE1, 


CE2 , CZ 




TIB: 


: GLY 


212: 


:N, 


CA, 


C,0 






TIB: 


:SER 


214: 


:N, 


CA, 


0,O,CB,0G 






TIB: 


:HIS 


215: 


:N, 


CA, 


C,0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB: 


:SER 


216: 


IN, 


CA, 


,C,O,0B,0G 




30 


TIB: 


:GLY 


225: 


IN, 


,CA, 


,c,o 






TIB, 


:LEU 


227: 


IN, 


,CA, 


,0,0,05, CG,CD1,CD2 






TIB: 


:VAL 


228: 


IN, 


CA, 


,C,0,CB,CG1,CG2 






TIB: 


:PRO 


229: 


iN, 


CA, 


,00,0,0, CB,CG 






TIB« 


:ILE 


241: 


IN, 


CA, 


,0,0,06,061,062,001 




35 


TIB 


:ASP 


242: 


IN, 


,CA, 


,0,0,06,00,001,002 






TIB 


: ALA 


243: 


IN, 


,CA, 


,0,0, CB 






TIB 


:THR 


244: 


IN, 


,CA, 


,C,0,CB,0G1,CG2 






TIB 


:PRO 


250 


IN, 


,CA, 


, CD , C , 0 , CB , CG 






TIB 


:PHE 


262 


:N 


rCA, 


,C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 


40 


TIB 


:CYS 


268 


IN, 


,CA, 


,C,0,CB,SG 





Subset SUB5B: 
sub5raole. list 



Subset SUB5B: 

TIB: 3-4, 6-7, 10-12, 15, 22-23 , 25-30, 35, 37 ,39 ,41-42, 44-47, 51- 
45 55,67,69-70, 

TIB: 72, 74-75, 94-99, 108-112, 114-115, 118 ,122-126, 128- 

131,135,137-138, 

TIB: 186, 188, 192-195, 213, 217-219, 223-224, 230-231, 234-235, 238- 
240, 

50 TIB:245,269 

subSbatom. list 
Subset SUB5B: 

TIB: SER 3 :N,CA,C,0,CB,0G 
TIB:GLN 4 :N,CA,C,0,CB,CG,CD,0E1,NE2 
55 TIB: LEU 6 :N, CA, C, O, CB, CG, CD1 , CD2 

TIB: PHE 7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
TIB:PHE 10:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 



ASN 
LEU 
GLN 
CYS 
GLY 
ASN 
ASN 
ASP 
ALA 
PRO 
ALA 
THR 
THR 
ASN 
CYS 
PRO 
VAL 
GLU 
LYS 
ALA 
PHE 
LEU 
TYR 
SER 
PHE 
LEU 
LEU 
ASP 
THR 
LYS 
LEU 
ASN 
PHE 
ASP 
LEU 
LYS 
GLU 
ARG 
GLY 
HIS 
ASP 
GLY 
THR 
SER 
ARG 
ASP 
THR 
LEU 
ARG 
GLN 
VAL 
GLU 
ASP 
ALA 
HIS 
ASP 
TYR 



11 
12 
15 
22 
23 
25 
26 
27 
28 
29 
30 
35 
37 
39 
41 
42 
44 
45 
46 
47 
51 
52 
53 
54 
55 
67 
69 
70 
72 
74 
75 
94 
95 
96 
97 
98 
99 





CA, 


C,0, 


CB ,CG,0D1,ND2 






:N, 


CA, 


C,0, 


CB,CG,CD1,CD2 






:N, 


CA, 


C,0, 


CB,CG,CD,OEl,NE2 






:N, 


CA, 


C,0, 


CB,SG 






:N, 


CA, 


C,0 








:N, 


CA, 


c,o, 


CB,CG,0D1,ND2 






:N, 


CA, 


c,o, 


CB,CG,ODl,ND2 






:N, 


CA, 


c,o, 


CB , CG , OD1 , OD2 






:N, 


CA, 


c,o, 


CB 






:N, 


CA, 


CD,C, 0,CB # CG 






:N, 


CA, 


c,o, 


CB 






:N, 


CA, 


C,0, 


CB,0G1,CG2 






:Nj 


CA, 


C,0, 


CB / 0G1 / CG2 






:N, 


CA, 


C f O, 


CB,CG,0D1,ND2 






:N ( 


CA, 


c,o, 


CB,SG 






:N» 


CA, 


CD,C,0,CB,CG 






:N, 


CA, 


C f O, 


CB, CG1,CG2 






:N, 


CA, 


C,0, 


CB,CG,CD,OEl,OE2 






:N, 


CA, 


C,0, 


CB,CG,CD,CE,NZ 






:N, 


CA, 


C,0, 


CB 






:N, 


CA, 


C,0, 


CB,CG,CD1,CD2,CE1, 


CE2 


,CZ 


:N ( 


CA, 


c,o, 


CB,CG,CD1,CD2 






:N, 


CA, 


c,o, 


CB , CG , CD1 , CD2 , CE1 , 


CE2 


,CZ,OH 


:N, 


CA, 


c,o, 


CB,OG 






:N, 


CA, 


0,0, 


CB , CG , CD1 , CD2 , CE1 , 


CE2 


,CZ 


:N, 


CA, 


0,0, 


CB,CG,CD1,CD2 






:N, 


,CA, 


0,0, 


CB,CG,CD1,CD2 






:N, 


,CA, 


0,0, 


CB,CG,ODl,OD2 






:N, 


,CA, 


o,o, 


CB,0G1,CG2 






:N, 


,CA, 


0,0, 


CB,CG,CD,CE,NZ 






:N, 


,CA, 


0,0, 


CB,CG,CD1,CD2 






:N, 


,CA, 


0,0, 


CB,CG,0D1,ND2 






:N, 


,CA, 


0,0, 


CB,CG,CD1,CD2,CE1, 


CE2 


,CZ 


:N, 


,CA, 


0,0, 


CB,CG,ODl,OD2 






:N, 


,CA, 


0,0, 


CB,CG,CD1,CD2 






:N, 


,CA, 


0,0, 


CB,CG,CD,CE,NZ 






:N, 


,CA, 


0,0, 


CB,CG,CD,OEl,OE2 







108:N, 


CA, 


c, 


,0,CB,CG,CD,NE,CZ,NH1,NH2 


109:N, 


CA, 


c. 


0 


110:N, 


CA, 


c, 


0 , CB , CG , ND1 , CD2 , CE1 , NE2 


111:N, 


CA, 




0,CB,CG,ODl,OD2 


112:N, 


CA, 


c, 


0 


114:N, 


CA, 


c, 


,0,CB,0G1,CG2 


115:N, 


CA, 


c. 


,0,CB,OG 


118:N, 


CA, 


c, 


,0,CB,CG,CD,NE,CZ,NH1,NH2 


122:N, 


CA, 


o> 


,0,CB,CG,ODl,OD2 


123:N, 


CA, 


o t 


f O,CB,OGl,CG2 


124:N, 


CA, 


o, 


,0,CB,CG,CD1,CD2 


125:N, 


CA, 


o, 


,0,CB,CG,CD,NE,CZ,NH1,NH2 


126:N, 


CA, 


c, 


f O,CB,CG,CD,OEl,NE2 


128:N, 


CA, 


c, 


,0,CB,CG1,CG2 


129:N, 


CA, 


o t 


,0,CB,CG,CD,OEl,OE2 


130:N, 


CA, 


o t 


r O,CB,CG,ODl,OD2 


131:N, 


CA, 


o t 


,0,CB 


135:N, 


CA, 


0 


,0,CB,CG,ND1,CD2,CE1,NE2 


137:N, 


CA, 


c 


r O,CB,CG,ODl,OD2 


138:N, 


CA, 


c 


,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
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TIB: 


:THR 


186 


:N, 


CA, 


C,0, 


CB,0G1,CG2 


TIB: 


:GLN 


188 


:N, 


CA, 


C,0, 


CB,CG,CD,0E1,NE2 


TIB: 


:THR 


192 


:N, 


CA, 


C,0, 


CB , 0G1 , CG2 


TIB: 


:LEU 


193 


:N, 


CA, 


c,o, 


CB,CG,CD1,CD2 


TIB: 


: TYR 


194 , 


:N r 


CA, 


c,o, 


CB,CG / CD1,CD2,CE1, 


TIB: 


ARG 


195 


:N, 


CA, 


c,o, 


CB,CG,CD,NE,CZ,NH1 


TIB: 


:TYR 


213 


;N, 


CA, 


c,o, 


CB , CG , CD1 , CD2 , CE1 , 


TIB: 


:SER 


217 


:N, 


CA, 


c,o, 


CB,0G 


TIB: 


:PRO 


218 


:N, 


CA, 


CD,C,0,CB,CG 


TIB: 


;GLU 


219: 


:N, 


CA, 


C,0, 


CB / CG,CD,0E1,0E2 


TIB: 


;LYS 


223: 


:N, 


CA, 


C,0, 


CB,CG,CD,CE,NZ 


TIB: 


:SER 


224, 


;N, 


CA, 


c,o, 


CB,OG 


TIB: 


:VAL 


230: 


:N, 


CA, 


c,o, 


CB,CG1,CG2 


TIB: 


; THR 


231: 


:N, 


CA, 


c,o, 


CB,0G1,CG2 


TIB: 


:ASP 


234: 


:N, 


CA, 


c,o, 


CB,CG,0D1,0D2 


TIB: 


:ILE 


235: 


;N, 


CA, 


c,o, 


CB,CG1,CG2,CD1 


TIB: 


:ILE 


238 


:N, 


CA, 


c,o, 


CB,CG1,CG2,CD1 


TIB: 


:GLU 


239 


:N, 


CA, 


c,o, 


CB,CG,CD,0E1,0E2 


TIB: 


tGLY 


240 


:N, 


CA, 


C,0 




TIB: 


tGLY 


245 


:N, 


CA, 


C,0 




TIB: 


:LEU 


269 


:N, 


CA, 


c,o, 


CB,OXT,CG,CDl,CD2 



10 



15 



20 



Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

25 ^6:17,21,80-87,89-90,113,143-153,170-176,196-206,221- 
222,226,246-249, 
TIB: 251-261, 263-267 
actsiteatom. list 
Subset ACTSITE: 



30 



35 



40 



45 



50 



55 



TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 



PHE 
ARG 



SER 17:N,CA,C,0,CB / OG 

TYR 2 1 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
80:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
81:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
GLY 82:N,CA,C,0 
SER 83:N,CA,C,0,CB,OG 

ARG 84:N,CA,C,0,CB,CG / CD,NE,CZ,NH1,NH2 

SER 85:N,CA,C,0,CB,OG 

ILE 86:N,CA,C,0,CB,CG1,CG2,CD1 

GLU 87:N,CA,C,0,CB,CG,CD,OEl,OE2 

TRP 89:N,CA,C,0,CB,CG,CDl,CD2 / NEl,CE2,CE3,CZ2 f CZ3,CH2 
ILE 90:N,CA,C,0,CB,CG1,CG2,CD1 

113 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
143:N,CA / C,0,CB,0G1,CG2 
GLY 144:N,CA,C,0 

HIS 145:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

SER 146:N,CA,C,0,CB,OG 

LEU 147:N,CA,C,0,CB,CG,CD1,CD2 

GLY 148:N,CA,C,0 

GLY 149:N,CA,C,0 

ALA 150:N 7 CA,C,O,CB 

LEU 151:N,CA, 0,0,06^,001,002 

ALA 152:N,CA,C,0,CB 

THR 153:N,CA,C,0,CB,OGl,CG2 

SER 170:N,CA,C,O,CB,OG 

TYR 1 7 1 : N , CA , C , 0 , CB , CG , CD1 , CD 2 , CE 1 , CE2 , C Z , OH 
GLY 172:N,CA,C,0 
ALA 173:N,CA,C,0,CB 



PHE 
THR 
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TIB: 


PRO 


174:N,CA f CD,C, 0, 


CB,CG 






TIB: 


ARG 


175:N,CA,C, 


0,0b, 


CG,CD,NE,CZ 


,NH1 




TIB: 


VAL 


176:N,CA,C, 


0,CB, 


CGI , CG2 






TIB: 


ILE 


196:N,CA,C, 


0,0b, 


CG1,CG2,CD1 




5 


TIB: 


THR 


197:N,CA,C # 


0,CB, 


0G1,CG2 






TIB: 


HIS 


198:N,CA,C, 


O f CB, 


CG,ND1,CD2, 


CE1, 




TIB: 


THR 


199:N,CA,C, 


0,CB, 


0G1,CG2 






TIB: 


ASN 


200:N,CA,C, 


0,0b, 


CG,0D1,ND2 






TIB: 


ASP 


201:N,CA,C # 


0,0b, 


CG,0D1,0D2 




10 


TIB: 


ILE 


202:N,CA,C, 


o,cb, 


CG1,CG2,CD1 






TIB: 


VAL 


203:N,CA,C, 


0,0b, 


CGI , CG2 






TIB: 


PRO 


204:N,CA,CD,C,0, 


CB,CG 






TIB: 


.ARG 


205:N,CA,C, 


0,0b, 


CG,CD,NE,C2 


,NH1 




TIB: 


LEU 


206:N,CA,C, 


0,0b, 


CG,CD1,CD2 




15 


TIB: 


TRP 












221: 


N,CA,C,0,CB, CG, 


CD1 , CD2 , NE1 , CE2 , 


CE3, 




TIB: 


:ILE 


222:N,CA,C, 


0,0b, 


CG1,CG2,CD1 






TIB: 


: THR 


226:N,CA,C, 


0,CB, 


0G1,CG2 






TIB: 


GLY 


246:N,CA,C, 


0 






20 


TIB: 


:ASN 


247:N,CA,C, 


O^CB, 


CG,0D1,ND2 






TIB: 


:ASN 


248:N,CA,C< 


0,0b, 


CG,0D1,ND2 






TIB: 


GLN 


249:N,CA,C, 


0,0b, 


CG,CD,OEl,NE2 




TIB: 


t ASN 


251:N,CA,C, 


0,0b, 


CG,0D1,ND2 






TIB: 


:ILE 


252:N,CA,C, 


o,cb, 


CG1,CG2,CD1 




25 


TIB: 


:PRO 


253:N,CA,CD,C,0, 


CB,CG 






TIB' 


:ASP 


254:N,CA,C, 


0,CB, 


CG,0D1,0D2 






TIB: 


:ILE 


255:N,CA,C, 


0,CB, 


CG1,CG2,CD1 






TIB 


:PRO 


256:N,CA,CD,C,0, 


CB,CG 






TIB 


: ALA 


257:N,CA,C, 


0,CB 






30 


TIB 


:HIS 


258:N,CA,C, 


0, CB, 


CG,ND1,CD2, 


CE1, 




TIB 


:LEU 


259:N,CA,C, 


0,CB, 


CG,CD1,CD2 






TIB 


:TRP 












260: 


:N,CA,C,0,CB,CG, 


CD1,CD2,NE1,CE2, 


CE3, 




TIB 


:TYR 


261:N,CA,C, 


,0,CB, 


CG,CD1,CD2, 


CE1, 


35 


TIB 


:GLY 


263:N,CA,C 


r 0 








TIB 


:LEU 


264:N / CA / C 


,0,0b, 


CG,CD1,CD2 






TIB 


:ILE 


265:N / CA / C 


fO,CB, 


CG1,CG2,CD1 






TIB 


:GLY 


266:N,CA,C 


f o 








TIB 


:THR 


267:N,CA,C 


r O,CB, 


061/CG2 




40 


Subset RESTX: 









restxmole. list 
Subset RESTX: 

NEWMODEL: 14, 16, 18-20, 31-34 ,36, 38, 40, 48-50, 56-66, 68,78- 
79 88 91—93 

45 NEWMODEL: 104-106, 120, 136, 225, 227-229, 250, 262, 268 
restxatom. list 
Subset RESTX: 

NEWMODEL : ALA 14:N,CA,C,0,CB 

NEWMODEL: TYR 16:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
50 NEWMODEL : ALA 18:N,CA,C,0,CB 

NEWMODEL : ALA 19:N,CA,C,0,CB 

NEWMODEL : ALA 20:N,CA,C,0,CB 

NEWMODEL : GLY 31:N,CA,C,0 

NEWMODEL : THR 32 :N,CA, 0,0,06,001,002 
55 NEWMODEL : ASN 33 :N,CA,C,0,CB,CG,0D1,ND2 

NEWMODEL: ILE 34 : N, OA, C , O , CB , CGI , CG2 , CD1 

NEWMODEL: CYS 36:N,CA,C,0,CB,SG 
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10 



15 



20 



25 



30 



35 



NEWMODEL: 


GLY 


38: 


NEWMODEL: 


ALA 


40: 


NEWMODEL: 


ASP 


48: 


NEWMODEL: 


ALA 


49: 


NEWMODEL: 


THR 


50: 


NEWMODEL: 


GLU 


56: 


NEWMODEL: 


ASP 


57: 


NEWMODEL: 


SER 


58: 


NEWMODEL: 


:GLY 


59: 


NEWMODEL: 


:VAL 


60: 


NEWMODEL: 


.GLY 


61: 


NEWMODEL: 


:ASP 


62: 


NEWMODEL: 


>VAL 


63: 


NEWMODEL: 


:THR 


64: 


NEWMODEL: 


:GLY 


65: 


NEWMODEL: 


:PHE 


66: 


NEWMODEL: 


: ALA 


68: 


NEWMODEL: 


:LEU 


78: 


NEWMODEL: 


:SER 


79: 


NEWMODEL: 


:ASN 


88: 


NEWMODEL: 


:GLY 


91: 


NEWMODEL: 


:ASN 


92: 


NEWMODEL: 


:LEU 


93: 


NEWMODEL: 


:CYS 


104 


NEWMODEL * 


:SER 


105 


NEWMODEL: 


:GLY 


106 


NEWMODEL: 


:VAL 


120 


NEWMODEL 


:PRO 


136 


NEWMODEL 


:GLY 


225 


NEWMODEL 


: LEU 


227 


NEWMODEL 


:VAL 


228 


NEWMODEL 


:PRO 


229 


NEWMODEL 


:PRO 


250 


NEWMODEL 


:PHE 


262 


NEWMODEL 


:CYS 


268 



N,CA, 


c, 


0 




N,CA # 


c, 


0,CB 




N,CA, 


c, 


0,CB, 


CG,ODl,OD2 


N,CA, 


c, 


0,CB 




N,CA, 


c, 


0,CB, 


0G1,CG2 


N,CA, 




0,0b, 


CG,CD,0E1,0E2 


N, CA, 


c, 


0,CB, 


CG,ODl,OD2 


N,CA, 


c, 


,0,0b, 


OG 


N,CA, 


c. 


,0 




N,CA, 




,0,CB, 


CG1,CG2 


N,CA< 




,0 




N,CA, 




,0,CB, 


CG,ODl,OD2 


N,CA, 


>c t 


,0,CB, 


CG1,CG2 


N,CA, 


>c. 


,0,0b, 


0G1,CG2 


N,CA, 




,0 




N^A, 


rC, 


,0,CB, 


CG , CD1 , CD2 , CE1 , CE2 , CZ 


N,CA, 




,0,CB 




N,CA, 


rC, 


t 0 , CB , 


CG,CD1,CD2 


N,CA, 


rC, 


,0,CB, 


OG 


N,CA, 


>c, 


rO, CB, 


CG,0D1,ND2 


N,CA, 


>c, 


,0 




N,CA< 


r c, 


rO,CB, 


CG,0D1,ND2 


N,CA, 


r c, 


r O,CB, 


CG,CD1,CD2 



,CE2,CZ 



Example 10 

Providing a lipase variant E87K+D254K 

The Humicola lanuginosa lipase variant E87K+D254K was 
40 constructed, expressed and purified as described in WO 
92/05249. 

Example 11 

Lipase-S-PEG 15,000 conjugate 
45 The lipase variant E87K+D254K-SPEG conjugate was prepared as 
described in Example 7, except that the enzyme is the Humicola 
lanuginosa lipase variant (E87K+D254K) described in Example 10 
and the polymer is mPEG15,000. 



50 Example 12 
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Immunoaenecitv assessed as IgGj^ of lipase vari ant rD87K+D254iO in 
Balb/C mice 

Balb/c mice were immunized by subcutanuous injection of: 

i) 50 fil 0.9% (wt/vol) NaCl solution (control group, 8 mice) 
5 (control) , 

ii) 50^1 0.9% (wt/vol) NaCl solution containing 25 ng of protein 
of a Humicola lanuginosa lipase variant (E87K+D254K) (group 1, 

8 mice) (unmodified lipase variant), 

iii) 50% 0.9% (wt/vol) NaCl solution containing a Humicola 

10 lanugoinosa lipase variant substituted in position D87K+D254K and 

coupled to a N-succinimidyl carbonate activated mPEG 15, 000 (group 

2, 8 mice) -(lipase-SPEGlS* 000) . 

The amount of protein for each batch was measured by optical 

density measurements. Blood samples (200 |al) were collected 
15 from the eyes one week after the immunization, but before the 

following immunization. Serum was obtained by blood clothing, 

and centrifugation. 

The IgGi response was determined by use of the Balb/C mice 

Igd EL ISA method as described above. 
20 Results; 

Five weekly immunizations were required to elicit a 
detectable humoral response to the unmodified Humicola 
lanuginosa variant. The antibody titers elicited by the 
conjugate (i.e. lipase-SPEGIS, 000 ranged between 960 and 1920, 
25 and were only 2 to 4x lower than the antibody titer of 3840 
that was elicited by unmodified HL8.2-Lipolase (figure to the 
left) . 

The results of the tests are shown in Figure 1 

As will be apparent to those skilled in the art, in the light 
30 of the foregoing disclosure, many alterations and modifications 
are possible in the practice of this invention without departing 
from the spirit or scope thereof. Accordingly, the scope of the 
invention is to be construed in accordance with the substance 
defined by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: Novo Nordisk A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsveard 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2880 
10 (G) TELEPHONE: +45 4444 8888 

<H) TELEFAX: +45 4449 3256 

(ii) TITLE OF INVENTION: A modified polypeptide 

(iii) NUMBER OF SEQUENCES: 9 
(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1.30 (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus sp. PD498, NCIMB No. 40484 
(ix) FEATURE: 

3 0 (A) NAME /KEY: CDS 

(B) LOCATION: 1. .840 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGG TCA CCG AAT GAC CCT TAC TAT TCT GCT TAC CAG TAT GGA CCA CAA 48 
35 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

AAC ACC TCA ACC CCT GCT GCC TGG GAT GTA ACC CGT GGA AGC AGC ACT 96 
Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 

4 0 20 25 30 

CAA ACG GTG GCG GTC CTT GAT TCC GGA GTG GAT TAT AAC CAC CCT GAT 144 
Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



45 



CTT GCA AGA AAA GTA ATA AAA GGG TAC GAC TTT ATC GAC AGG GAC AAT 192 
Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
50 55 60 



50 AAC CCA ATG GAT CTT AAC GGA CAT GGT ACC CAT GTT GCC GGT ACT GTT 240 
Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

GCT GCT GAT ACG AAC AAT GGA ATT GGC GTA GCC GGT ATG GCA CCA GAT 288 
55 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 

85 90 95 

ACG AAG ATC CTT GCC GTA CGG GTC CTT GAT GCC AAT GGA AGT GGC TCA 336 
Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
60 * 100 105 HO 

CTT GAC AGC ATT GCC TCA GGT ATC CGC TAT GCT GCT GAT CAA GGG GCA 384 
Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 



65 



AAG GTA CTC AAC CTC TCC CTT GGT TGC GAA TGC AAC TCC ACA ACT CTT 432 
Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
130 135 140 



70 AAG AGT GCC GTC GAC TAT GCA TGG AAC AAA GGA GCT GTA GTC GTT GCT 



480 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

GCT GCA GGG AAT GAC AAT GTA TCC CGT ACA TTC CAA CCA GCT TCT TAC 528 
5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

CCT AAT GCC ATT GCA GTA GGT GCC ATT GAC TCC AAT GAT CGA AAA GCA 576 
Pro Asn Ala lie Ala Val Gly Ala lie Asp Ser Asn Asp Arg Lys Ala 
10 180 - 185 190 

TCA TTC TCC AAT TAC GGA ACG TGG GTG GAT GTC ACT GCT CCA GGT GTG 624 
Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 



15 



AAC ATA GCA TCA ACC GTT CCG AAT AAT GGC TAC TCC TAC ATG TCT GGT 672 
Asn lie Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
210 215 220 



2 0 ACG TCC ATG GCA TCC CCT CAC GTG GCC GGT TTG GCT GCT TTG TTG GCA 720 
Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

AGT CAA GGT AAG AAT AAC GTA CAA ATC CGC CAG GCC ATT GAG CAA ACC 768 
25 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

GCC GAT AAG ATC TCT GGC ACT GGA ACA AAC TTC AAG TAT GGT AAA ATC 816 
Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
30 260 265 270 

AAC TCA AAC AAA GCT GTA AGA TAC 840 
Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 280 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

45 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
1 5 10 15 

Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
20 25 30 



50 



Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
55 50 55 60 

Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

60 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 

85 90 95 

Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
100 105 HO 

65 

Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 



Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
70 130 135 140 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

Pro Asn Ala He Ala Val Gly Ala He Asp Ser Asn Asp Arg Lys Ala 
180 185 190 

10 

Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 

Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
15 210 215 220 

Thr Ser Met Ala Ser Pro H1b Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

20 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
260 265 270 

25 

Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

40 Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

1 5 10 15 



45 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 50 ~ 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

55 Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 



60 



Ser Gly Ser Gly Ser Val Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
65 130 135 ,140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Gly Ser He Ser 
145 150 155 160 

70 Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Arthromyces ramosus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Gin Gly Pro Gly Gly Gly Gly Gly Ser Val Thr Cys Pro Gly Gly Gin 
1 5 * 10 15 

Ser Thr Ser Asn Ser Gin Cys Cys Val Trp Phe Asp Val Leu Asp Asp 
20 25 30 

Leu Gin Thr Asn Phe Tyr Gin Gly Ser Lys Cys Glu Ser Pro Val Arg 
35 40 45 

Lys He Leu Arg He Val Phe His Asp Ala He Gly Phe Ser Pro Ala 
50 55 60 

Leu Thr Ala Ala Gly Gin Phe Gly Gly Gly Gly Ala Asp Gly Ser He 
65 70 75 80 

He Ala His Ser Asn He Glu Leu Ala Phe Pro Ala Asn Gly Gly Leu 
85 90 95 

Thr Asp Thr He Glu Ala Leu Arg Ala Val Gly He Asn His Gly Val 
100 105 HO 

Ser Phe Gly Asp Leu He Gin Phe Ala Thr Ala Val Gly Met Ser Asn 
115 120 125 

Cys Pro Gly Ser Pro Arg Leu Glu Phe Leu Thr Gly Arg Ser Asn Ser 
130 135 140 

Ser Gin Pro Ser Pro Pro Ser Leu He Pro Gly Pro Gly Asn Thr Val 
145 150 155 160 

Thr Ala He Leu Asp Arg Met Gly Asp Ala Gly Phe Ser Pro Asp Glu 
165 170 175 

Val Val Asp Leu Leu Ala Ala His Ser Leu Ala Ser Gin Glu Gly Leu 
180 185 190 

Asn Ser Ala He Phe Arg Ser Pro Leu Asp Ser Thr Pro Gin Val Phe 
195 200 205 
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Asp Thr Gin Phe Tyr lie Glu Thr Leu Leu Lys Gly Thr Thr Gin Pro 
210 215 220 

Gly Pro Ser Leu Gly Phe Ala Glu Glu Leu Ser Pro Phe Pro Gly Glu 
5 225 230 235 240 

Phe Arg Met Arg Ser Asp Ala Leu Leu Ala Arg Asp Ser Arg Thr Ala 
245 250 255 

10 Cys Arg Trp Gin Ser Met Thr Ser Ser Asn Glu Val Met Gly Gin Arg 

260 265 270 



15 



Tyr Arg Ala Ala Met Ala Lys Met Ser Val Leu Gly Phe Asp Arg Asn 
275 280 285 

Ala Leu Thr Asp Cys Ser Asp Val He Pro Ser Ala Val Ser Asn Asn 
290 ~ 295 300 



Ala Ala Pro Val He Pro Gly Gly Leu Thr Val Asp Asp He Glu Val 
20 305 310 315 320 

Ser Cys Pro Ser Glu Pro Phe Pro Glu He Ala Thr Ala Ser Gly Pro 
325 330 335 

25 Leu Pro Ser Leu Ala Pro Ala Pro 

340 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa DSM 4109 
(ix) FEATURE: 

(A) NAME /KEY: sig jpeptide 

(B) LOCATION:!. .66 
40 (ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 67. .876 
(ix) FEATURE: 

(A) NAME /KEY : CDS 
45 (B) LOCATION: 1.. 876 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AGG AGC TCC CTT GTG CTG TTC TTT GTC TCT GCG TGG ACG GCC TTG 48 
Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
50 -22 -20 -15 -10 

GCC AGT CCT ATT CGT CGA GAG GTC TCG CAG GAT CTG TTT AAC CAG TTC 96 
Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 10 



55 



AAT CTC TTT GCA CAG TAT TCT GCA GCC GCA TAC TGC GGA AAA AAC AAT 144 
Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 



60 GAT GCC CCA GCT GGT ACA AAC ATT ACG TGC ACG GGA AAT GCC TGC CCC 192 
Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

GAG GTA GAG AAG GCG GAT GCA ACG TTT CTC TAC TCG TTT GAA GAC TCT 240 
65 Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
45 50 55 

GGA GTG GGC GAT GTC ACC GGC TTC CTT GCT CTC GAC AAC ACG AAC AAA 288 
Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
70 60 65 70 
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TTG ATC GTC CTC TCT TTC CGT GGC TCT CGT TCC ATA GAG AAC TGG ATC 336 

Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

5 

GGG AAT CTT AAC TTC GAC TTG AAA GAA ATA AAT GAC ATT TGC TCC GGC 384 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

10 TGC AGG GGA CAT GAC GGC TTC ACT TCG TCC TGG AGG TCT GTA GCC GAT 432 
Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

ACG TTA AGG CAG AAG GTG GAG GAT GCT GTG AGG GAG CAT CCC GAC TAT 480 
15 Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
125 130 135 

CGC GTG GTG TTT ACC GGA CAT AGC TTG GGT GGT GCA TTG GGA ACT GTT 528 
Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
20 140 145 150 

GCC GGA GCA GAC CTG CGT GGA AAT GGG TAT GAT ATC GAC GTG TTT TCA 576 
Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 



25 



TAT GGC GCC CCC CGA GTC GGA AAC AGG GCT TTT GCA GAA TTC CTG ACC 624 
Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 



30 GTA CAG ACC GGC GGA ACA CTC TAC CGC ATT ACC CAC ACC AAT GAT ATT 672 
Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

GTC CCT AGA CTC CCG CCG CGC GAA TTC GGT TAC AGC CAT TCT AGC CCA 720 
35 Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
205 210 215 

GAG TAC TGG ATC AAA TCT GGA ACC CTT GTC CCC GTC ACC CGA AAC GAT 768 
Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
40 220 225 230 

ATC GTG AAG ATA GAA GGC ATC GAT GCC ACC GGC GGC AAT AAC CAG CCT 816 
He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 



45 



AAC ATT CCG GAT ATC CCT GCG CAC CTA TGG TAC TTC GGG TTA ATT GGG 864 
Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 



50 ACA TGT CTT TAG 876 
Thr Cys Leu * 
270 

(2) INFORMATION FOR SEQ ID NO: 6: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
-22 -20 -15 -10 

65 Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 10 



Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 
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Asp Ala Pro Ala Gly Thr Asn lie Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
5 45 ~ 50 55 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
60 65 70 

10 Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

15 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
20 125 130 135 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
140 145 150 

25 Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 

30 

Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
35 205 210 215 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
220 " 225 230 

40 He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 

Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 

45 

Thr Cys Leu * 
270 

50 (2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R28K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

60 gggatgtaac caagggaagc agcactcaaa eg 32 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R62K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 8: 
5 cgactttatc gataaggaca ataaccc 27 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R169K oligo" 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



caatgtatcc aaaacgttcc aaccagc 



27 
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Patent Claims 

1. A polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
5 polypeptide, having been modified in a manner to increase the 

number of attachment groups on the surface of the polypeptide, in 
comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
10 polypeptide, having been modified in a manner to decrease the 

number of attachment groups at or close to the functional site(s) 
of the polypeptide, in comparison to the number of attachment 
groups available on the corresponding parent polypeptide, 

2. The conjugate according to claims 1, having 1 to 25, 
15 preferably 1 to 10 additional polymeric molecules coupled to the 

surface of the polypeptide in comparison to the number of 
polymeric molecules of a conjugate prepared from the corresponding 
parent enzyme. 

3. The conjugate according to claims 1 and 2, wherein the 
20 additional attachment group (s) is (are) amino groups in the form of 

Lysine residues (s), or carboxylic groups in the form of Aspartic 
acid or Glutamic acid residues. 

4. The conjugate according to any of claims 1 to 3, wherein 
the additional attachment group (s) is (are) prepared by a 

25 conservative substitution of an amino acid residue, such as an 
Arginine to Lysine substitution. 

5. The conjugate according to claims 1 to 3, wherein the 
additional attachment group (s) is (are) prepared by a conservative 
substitution of an amino acid, such as an Aspargine to 

30 Aspartate/ Glut amate or a Glutamine to Aspartate/Glutamate 
substitution. 

6. The conjugate according to any of claims 1 to 5, wherein 
the added attachment group is located more than 5 A, preferably 8 
A, especially 10 A from the functional site. 

35 7. The conjugate according to claim 1, having 1 to 25 

preferably 1 to 10 fewer polymeric molecules coupled at or close 
to the functional site of the polypeptide in comparison to the 
number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 
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8. The conjugate according to claim 7, wherein the removed 
attachment group (s) is (are) amino groups in the form of Lysine 
residues (s) , or carboxylic groups in the form of Aspartic acid or 
Glutamic acid residues. 
5 9. The conjugate according to any of claims 7 and 8, wherein 

the removed attachment group (s) is (are) prepared by a conservative 
substitution of an amino group, such as Lysine to Arginine 
substitution. 

10. The conjugate according to any of claims 7 to 8, wherein 
10 the removed attachment group (s) is (are) prepared by a conservative 

substitution of a carboxylic group, such as an Aspartate /Glutamate 
to Aspargine or Aspartate/Glutamate to a Glutamine substitution. 

11. The conjugate according to any of claims 1 to 10, wherein 
the removed attachment group is located within 5 A, preferably 8 

15 A, especially 10 A from the functional site. 

12. The conjugate according to any of claims 1 to 11, wherein 
the attachment groups are broadly spread. 

13. The conjugates according to claims 1 to 12, wherein the 
parent polypeptide moiety of the conjugate has a molecular weight 

20 from 1 to 100 kDa, preferred 15 to 100 kDa. 

14. The conjugate according to claim 13, wherein the parent 
polypeptide moiety of the conjugate has a molecular weight of from 
1 to 35 kDa. 

15. The conjugates according to claim 14, wherein the parent 
25 polypeptide is an enzyme selected from the group of 
Oxidoreductases, including laccases and Superoxide dismutase 
(SOD); Hydrolases, including proteases, especially subtilisins, 
and lipolytic enzymes; Transferases, including Transglutaminases 
(TGases) ; Isomerases, including Protein disulfide Isomerases 
30 (PDI) . 

16. The conjugate according to claim 15, wherein the parent 
enzyme is PD498, Savinase®, BPN" , Proteinase K, Proteinase R, 
Subtilisin DY, Lion Y, Rennilase®, JA16, Alcalase® or a Humicola 
lanuginosa lipase, such as Lipolase®. 
35 17. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a PD498 variant with one or more of the 
following substitutions: R51K, R62K, R121K, R169K, R250K, R28K, 
R190K, P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, 
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G87K, I88K, N209K, A211K, N216K, N217K, G218K r Y219K, S220K, 
Y221K, G262K. 

18. The conjugate according to claim 17, with one of the 
following mutations: R28K+R62K, R28K+R169K, R62K + R169K, 
5 R28K+R69K+R169K. 

19. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Savinase® variant with one or more of 
the following substitutions: R10K, R19K, R45K, R145K, R170K, 
R186K, R247K, K94R, P5K, P14K, T22K, T38K, H39K, P40K, L42K, 

10 L75K, N76K, L82K, P86K, S103K, V104K, S105K, A108K, A133K, 
T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, G211K, S212K, 
T213K, A215K, S216K, N269K. 

20. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Humicola lanuginosa lipase variant 

15 with one or more of the following substitutions: 

R133K / R139K f R160K r R179K / R209K,R118K,R125K / A18K / G31K / T32K, 
N33K / G38K / A40K / D48K,T50K / E56K,D57K f S58K f G59K,V60K / G61K / D62K # 
T64K / L78K # E87K,N88K f G91K f N92K,L93K f S105K f G106K / V120K / P136K / G225 
K,L227K,V228K,P229K,P250K,D254K,F262K. 

20 21. The conjugate according to claim 20 with the following 

mutations E87K+D254K. 

22. The conjugate according to any of claims 1 to 21, wherein 
the polymeric molecules coupled to the polypeptide have a 
molecular weight from 1 to 60 kDa, especially 1-35 kDa, especially 

25 3 to 25 kDa. 

23. The conjugate according to claim 22, wherein the poly- 
meric molecule is selected from the group comprising a natural or 
synthetic homo- and heteropolymers, selected from the group of the 
synthetic polymeric molecules including Branched PEGs, poly-vinyl 

30 alcohol (PVA) , poly-carboxyl acids, poly-(vinylpyrolidone) and 
poly-D,L-amino acids, or natural occurring polymeric molecules 
including dextrans, including carboxymethyl-dextrans, and 
celluloses such as methylcellulose, carboxymethylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, and 

35 hydrolysates of chitosan, starches, such as hydroxyethyl-starches, 
hydroxypropyl-starches, glycogen, agarose, guar gum, inulin, 
pullulans, xanthan gums, carrageenin, pectin and alginic acid. 

24. A method for preparing improved polypeptide-polymer 
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conjugates comprising the steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
5 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
10 selected in step b) at or close to the functional site, 

d) coupling polymeric molecules to the mutated polypeptide. 

25. The method according to claim 24, wherein the 
identification of amino acid residues located on the surface on 
the polypeptide referred to in step a) are performed by a computer 

15 program analyzing the 3D structure of the parent polypeptide in 
question. 

26. The method according to claim 24, wherein step b) 
comprises selecting Arginine or Lysine residues on the surface of 
the parent polypeptide. 

20 27. The method according to claim 24, wherein one or more 

Arginine residues identified in step b) is (are) substituted with a 
Lysine residue (s) in step c) . 

28. The method according to claims 27, wherein the 
substituted Arginine residues have a distance of more than 5 A, 

25 preferably 8 A , especially 10 A from the functional site. 

29. The method according to any of claims 24 to 28, wherein 
the polypeptide prepared in step . c) is coupled to polymeric 
molecules. 

30. Use of the conjugate in claims 1 to 23 for reducing the 
30 allergenicity of industrial products. 

31. Use of the conjugate in claims 1 to 23 for reducing the 
immunogenicity of pharmaceuticals. 

32. A composition comprising a conjugate of any of claims 1 
to 23 and further comprising ingredients used in industrial 

35 products. 

33. The composition according to claim 32, wherein the 
industrial product is a detergent, such as a laundry, dish wash or 
hard surface cleaning product, or a food or feed product. 
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34. The composition according to claim 32, comprising a 
conjugate of any of claims 1 to 22 and further ingredients used in 
skin care products. 

35. A composition comprising a conjugate of any of claims 1 
5 to 23 and further comprising ingredients used in pharmaceuticals. 
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